| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Video Reasoning | MMVU mc | Score82.6 | 16 | |
| Multimodal Understanding | MMVU | Accuracy75.4 | 13 | |
| Video Understanding | MMVU | Direct Score32 | 10 | |
| Video Question Answering | MMVU mc | Accuracy75.4 | 9 | |
| Video Understanding | MMVU (val) | Score46 | 9 | |
| Multi-modal Video Understanding | MMVU | Score67 | 6 | |
| Video Question Answering | MMVU | M-Avg67.2 | 5 | |
| Video reasoning | MMVU | Accuracy75.8 | 3 |