| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Video Question Answering | Vript-ERO (VERO) | Accuracy40.3 | 16 | |
| Video Question Answering | Vript-RR (VRR) | Average Score85.3 | 16 | |
| Hallucination Evaluation | VRIPT-HAL (test) | F1 Score52.9 | 15 | |
| Video Question Answering | Vript RR | Scene Accuracy (M)0.921 | 14 | |
| Event Re-ordering | Vript ERO | Rank-1 Score81 | 8 |