| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MSRVTT-QA | Flash-VStream | Accuracy72.4 | 491 | 3d ago | |
| ActivityNet-QA | Penguin-VL | Accuracy65.2 | 376 | 4d ago | |
| MSRVTT-QA (test) | CLIPBERT | Accuracy88.2 | 376 | 23d ago | |
| MSVD-QA | Flash-VStream | Accuracy80.3 | 360 | 3d ago | |
| ActivityNet-QA (test) | Accuracy82.78 | 288 | 29d ago | ||
| MSVD-QA (test) | Video-QTR | Accuracy87.8 | 279 | 23d ago | |
| EgoSchema (Full) | Human Eval | Accuracy75 | 221 | 12d ago | |
| VideoMME | Gemini 2.5 Pro | Accuracy85.1 | 210 | 3d ago | |
| NExT-QA (test) | Accuracy86.3 | 204 | 1mo ago | ||
| LongVideoBench | Gemini-2.5-Pro | Accuracy77.6 | 180 | 4d ago | |
| NExT-QA (val) | Overall Acc88.4 | 176 | 1mo ago | ||
| EgoSchema | A4VL | Accuracy82.2 | 161 | 4d ago | |
| TGIF-QA | HiTeA | Accuracy97.2 | 156 | 1mo ago | |
| MSVD | LVLM | Accuracy79.5 | 152 | 3d ago | |
| MLVU | VideoChat-A1 | Accuracy76.2 | 143 | 4d ago | |
| VideoMMMU | Gemini 2.5 Pro | Accuracy74.9 | 124 | 16d ago | |
| EgoSchema subset | VideoHV-Agent | Accuracy81 | 114 | 12d ago | |
| NExT-QA Multi-choice | VideoLLaMA3-7B | Accuracy84.5 | 114 | 29d ago | |
| LVBench | HAVEN | Accuracy84.1 | 108 | 4d ago | |
| NEXT-QA | LinVT-Qwen2-VL | Overall Accuracy85.5 | 105 | 1mo ago | |
| MSRVTT | LVLM | Accuracy66.7 | 100 | 3d ago | |
| EgoSchema (test) | QwenVL2 | Accuracy77.9 | 90 | 10d ago | |
| MVBench | Gemini 1.5 Pro | Accuracy81.3 | 90 | 1mo ago | |
| TGIF-QA (test) | All-in-one-B * | Accuracy95.5 | 89 | 1mo ago | |
| NextQA | Penguin-VL | Accuracy85.4 | 78 | 16d ago |