| MSRVTT-QA | Flash-VStream | Accuracy72.4 | | 481 | 2d ago |
| MSRVTT-QA (test) | CLIPBERT | Accuracy88.2 | | 371 | 2d ago |
| MSVD-QA | Flash-VStream | Accuracy80.3 | | 340 | 2d ago |
| ActivityNet-QA | | Accuracy64.4 | | 319 | 2d ago |
| ActivityNet-QA (test) | | Accuracy82.78 | | 275 | 2d ago |
| MSVD-QA (test) | Video-QTR | Accuracy87.8 | | 274 | 2d ago |
| NExT-QA (test) | | Accuracy86.3 | | 204 | 2d ago |
| EgoSchema (Full) | Human Eval | Accuracy75 | | 193 | 3d ago |
| NExT-QA (val) | | Overall Acc88.4 | | 176 | 3d ago |
| TGIF-QA | HiTeA | Accuracy97.2 | | 147 | 2d ago |
| NEXT-QA | LinVT-Qwen2-VL | Overall Accuracy85.5 | | 105 | 3d ago |
| NExT-QA Multi-choice | LLaVA-Video | Accuracy83.2 | | 102 | 2d ago |
| MSVD | LVLM | Accuracy79.5 | | 100 | 2d ago |
| VideoMME | Gemini 2.5 Pro | Accuracy85.1 | | 99 | 3d ago |
| MVBench | Gemini 1.5 Pro | Accuracy81.3 | | 90 | 3d ago |
| TGIF-QA (test) | All-in-one-B * | Accuracy95.5 | | 89 | 2d ago |
| EgoSchema | | Accuracy77.2 | | 88 | 3d ago |
| EgoSchema (test) | QwenVL2 | Accuracy77.9 | | 80 | 2d ago |
| EgoSchema subset | VideoMultiAgents | Accuracy75.4 | | 73 | 3d ago |
| MSRVTT-MC | UMT-L | Accuracy97.7 | | 61 | 3d ago |
| Perception (test) | VideoChat-Flash | Test Accuracy75.6 | | 59 | 3d ago |
| ActivityNet (test) | LLaVA-OneVision | Accuracy62.3 | | 57 | 3d ago |
| MSVD-QA zero-shot (test) | LLaVA+FreeVA | Accuracy81.5 | | 56 | 3d ago |
| ActivityNet-QA zero-shot (test) | LinVT-Qwen2-VL | Accuracy60.1 | | 55 | 3d ago |
| MSRVTT-QA zero-shot (test) | LLaVA+FreeVA | Accuracy72.9 | | 55 | 3d ago |