| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Long VideoBench (val) | Accuracy72.6 | 36 | 3d ago | ||
| MovieChat-1K Breakpoint Mode (test) | HierarQ | Accuracy76.4 | 24 | 3d ago | |
| MovieChat-1K Global Mode (test) | HierarQ | Accuracy87.5 | 24 | 3d ago | |
| MLVU | M-Avg77.3 | 22 | 3d ago | ||
| EgoSchema (full set) | Dispider | Accuracy55.6 | 17 | 3d ago | |
| Video-MME w/o subtitles | Accuracy0.818 | 14 | 3d ago | ||
| Video-MME (val) | Gemini-1.5-Pro | Accuracy75 | 12 | 3d ago | |
| MLVU multiple-choice questions | VideoXL | Accuracy0.649 | 12 | 3d ago | |
| MMBench-Video (val) | InternVL2.5-78B | Score1.97 | 11 | 3d ago | |
| TemporalBench | GPT-4o | Binary Accuracy73.2 | 9 | 3d ago | |
| TVQA Long | LLaVA-Video + OneClip-RAG | Overall Accuracy52.1 | 6 | 3d ago | |
| GLVC (test) | VideoDetective | Score69 | 6 | 3d ago | |
| QaEgo4D | LLaVA-Video + OneClip-RAG | Score1.71 | 5 | 3d ago |