| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MSRVTT-QA | Flash-VStream | Accuracy72.4 | 505 | 1mo ago | |
| ActivityNet-QA | Qwen3-VL + CurEvo | Accuracy71.02 | 418 | 5d ago | |
| MSVD-QA | Flash-VStream | Accuracy80.3 | 393 | 1mo ago | |
| MSRVTT-QA (test) | CLIPBERT | Accuracy88.2 | 376 | 2mo ago | |
| ActivityNet-QA (test) | Accuracy82.78 | 288 | 2mo ago | ||
| MSVD-QA (test) | Video-QTR | Accuracy87.8 | 279 | 2mo ago | |
| VideoMME | Gemini 2.5 Pro | Accuracy85.1 | 251 | 21d ago | |
| EgoSchema (Full) | Human Eval | Accuracy75 | 241 | 12d ago | |
| LongVideoBench | Gemini-2.5-Pro | Accuracy77.6 | 210 | 7d ago | |
| NExT-QA (test) | Accuracy86.3 | 204 | 2mo ago | ||
| MLVU | VideoChat-A1 | Accuracy76.2 | 194 | 1d ago | |
| NExT-QA (val) | Overall Acc88.4 | 176 | 3mo ago | ||
| EgoSchema | A4VL | Accuracy82.2 | 161 | 1mo ago | |
| TGIF-QA | HiTeA | Accuracy97.2 | 156 | 2mo ago | |
| MSVD | LVLM | Accuracy79.5 | 152 | 1mo ago | |
| VideoMMMU | Gemini 2.5 Pro | Accuracy74.9 | 140 | 1mo ago | |
| EgoSchema subset | VideoHV-Agent | Accuracy81 | 124 | 21d ago | |
| NExT-QA Multi-choice | VideoLLaMA3-7B | Accuracy84.5 | 114 | 2mo ago | |
| LVBench | HAVEN | Accuracy84.1 | 108 | 1mo ago | |
| NEXT-QA | LinVT-Qwen2-VL | Overall Accuracy85.5 | 105 | 22d ago | |
| MSRVTT | LVLM | Accuracy66.7 | 100 | 1mo ago | |
| EgoSchema (test) | QwenVL2 | Accuracy77.9 | 90 | 1mo ago | |
| MVBench | Gemini 1.5 Pro | Accuracy81.3 | 90 | 3mo ago | |
| TGIF-QA (test) | All-in-one-B * | Accuracy95.5 | 89 | 3mo ago | |
| LongVideoBench (val) | HAVEN | Accuracy83 | 87 | 21d ago |