| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| DailyOmni | OmniAgent | Average Score82.71 | 83 | 15d ago | |
| WorldSense | Accuracy66.4 | 72 | 15d ago | ||
| Daily-Omni | OmniVideo-R1 | Accuracy82.8 | 58 | 15d ago | |
| Audio-visual understanding evaluation suite (WorldSense, Daily-Omni, OmniVideoBench, Video-MME, and LVOmniBench) (test) | WorldSense Score46.7 | 31 | 14d ago | ||
| IntentBench | OmniVideo-R1 | Accuracy74.2 | 20 | 2mo ago | |
| Video-MME | Score73.4 | 15 | 3mo ago | ||
| AVUT AV-Human | Accuracy0.7834 | 12 | 1d ago | ||
| Video-MME w/ audio | Ola | Accuracy68.4 | 10 | 3mo ago | |
| VideoHolmes | Accuracy67 | 10 | 3mo ago | ||
| AV-SpeakerBench | Score75.1 | 9 | 1mo ago | ||
| AVHBench | TAC-V | Overall Score81.7 | 8 | 3mo ago | |
| AVUT | Score85.6 | 8 | 1mo ago | ||
| Video-Holmes | Qwen3-Omni-Instruct | Score0.541 | 6 | 3mo ago | |
| JointAVBench | MiniCPM-o 4.5 | Overall Score60 | 3 | 1mo ago |