| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Video Reasoning | Video-Holmes | Score46.7 | 20 | |
| Video Reasoning | Video-Holmes | Accuracy42.6 | 14 | |
| Video Understanding | Video-Holmes | Accuracy58.3 | 11 | |
| Video Question Answering | Video-Holmes | Average Score46.5 | 6 | |
| Audio-Visual Understanding | Video-Holmes | Score0.541 | 6 | |
| Audiovisual Understanding & Reasoning | Video-Holmes | Score59.2 | 4 |