| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| VideoMMMU | Video-Zero | Accuracy68.33 | 89 | 23h ago | |
| Video-Holmes | VideoChat-M1 | Accuracy60.5 | 83 | 7d ago | |
| Video-MMMU | Accuracy84.6 | 68 | 20d ago | ||
| VideoMathQA | CoPD | Accuracy55.76 | 61 | 19d ago | |
| MMVU | Accuracy75.8 | 57 | 19d ago | ||
| Video-MME | OmniJigsaw (CMM) | Overall Performance73.1 | 55 | 23h ago | |
| LongVideoReason | Video-Zero | Accuracy73.1 | 54 | 19d ago | |
| VSI-Bench | EvoVid | Accuracy43.1 | 51 | 23h ago | |
| VBVR-Bench Out-of-Domain | VBVR-Wan2.2 | Average Score61 | 39 | 21h ago | |
| MVBench | Triage | MVBench Score64.7 | 39 | 23h ago | |
| VBVR-Bench In-Domain | Average Score96 | 35 | 21h ago | ||
| Video-Holmes | Score46.7 | 34 | 2mo ago | ||
| VSI-Bench (test) | Video-ToC | Accuracy38.6 | 29 | 1mo ago | |
| SAGE-Bench 1.0 (test) | SAGE-Flash | Overall Score73.4 | 29 | 3mo ago | |
| VideoMathQA MCQ | DBTrimKV | Accuracy36.43 | 27 | 21d ago | |
| VideoMMMU comprehension | DBTrimKV | Accuracy61.33 | 27 | 21d ago | |
| VideoMMMU adaptation | DBTrimKV | Accuracy39 | 27 | 21d ago | |
| Seed-Bench R1 | APPO | Average Answer Score50.5 | 26 | 3mo ago | |
| MMVU Multiple-choice (test) | GPT-4o | Accuracy75.4 | 25 | 26d ago | |
| LVBench | Triage | LVBench Score43.3 | 24 | 3mo ago | |
| LongVideoBench | Triage | LongVideoBench Score59 | 24 | 20d ago | |
| EgoSchema (test) | CLiViS (InternVL3) | Accuracy69.4 | 23 | 8d ago | |
| Video-Holmes | VideoSeek | SR56.1 | 22 | 2mo ago | |
| MLVU (test) | OmniJigsaw (CMM) | Accuracy62.75 | 19 | 26d ago | |
| STAR | BoxTuning | Score67.7 | 19 | 1mo ago |