| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Audio-Visual Understanding | DailyOmni | Average Score82.71 | 83 | |
| Omni-modal reasoning | DailyOmni (test) | Score67.8 | 15 | |
| Omnimodal Reasoning | DailyOmni | Overall Accuracy69.9 | 13 | |
| Omni-modal Understanding | DailyOmni | Score80.2 | 11 | |
| Audio-Visual QA | DailyOmni | Accuracy55.56 | 6 | |
| Text Query QA | DailyOmni | Score84.6 | 3 |