| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MVBench | LLaVA-Video-7B | Accuracy100 | 563 | 21h ago | |
| VideoMME | Gemini-1.5-Pro | Score (Overall)75 | 357 | 22h ago | |
| VideoMME | Overall Score100 | 222 | 15d ago | ||
| MLVU | Score78.19 | 221 | 1mo ago | ||
| MVBench (test) | FSR | Accuracy100.3 | 190 | 12d ago | |
| EgoSchema | FLoC | EgoSchema Score69.4 | 185 | 21h ago | |
| LongVideoBench | VisionZip† + DyTok (7B) | LongVideoBench Score59.2 | 123 | 22h ago | |
| MLVU | RETOOL-VIDEO | Accuracy81.5 | 114 | 13d ago | |
| Video-MME without subtitles | Overall Score84.8 | 108 | 22d ago | ||
| LVB | FLoC | Accuracy66.49 | 99 | 15d ago | |
| Video-MME | HiMu | Overall Score78.18 | 96 | 2mo ago | |
| Video-MME | Overall Score78.78 | 92 | 2mo ago | ||
| LongVideoBench, MLVU, and VideoMME Aggregate | InternVL3-8B + LiteFrame | Average Score65.7 | 84 | 15d ago | |
| MLVU | Accuracy87.34 | 80 | 2mo ago | ||
| MMVU | GPT-4o | Accuracy75.4 | 76 | 12d ago | |
| LVBench | Average Score73.5 | 75 | 13d ago | ||
| Aggregate MVBench, LongVideo Bench, MLVU, VideoMME | Qwen3-VL-8B-Instruct | Average Accuracy100 | 63 | 14d ago | |
| VideoMME | Accuracy (No Subtitles)65.1 | 60 | 1mo ago | ||
| VideoMMMU | Accuracy68.64 | 59 | 14d ago | ||
| Video-MME v1.0 (test) | TS-LLaVA | Score (Short)72.4 | 56 | 2mo ago | |
| LongVideoBench | GPT-4o | Accuracy66.7 | 56 | 14d ago | |
| TempCompass MCQ (test) | Accuracy82.8 | 55 | 2mo ago | ||
| EgoSchema (test) | Qwen2-VL-72B | Accuracy77.9 | 55 | 2mo ago | |
| Video-MME (test) | Accuracy88.6 | 51 | 2mo ago | ||
| VideoEvalPro (test) | Gemini 2.5 Pro | Accuracy0.784 | 50 | 2mo ago |