| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Video Understanding | MotionBench (val) | Accuracy65.4 | 50 | |
| Temporal video understanding | MotionBench | Accuracy65.4 | 19 | |
| Motion Understanding | MotionBench | Accuracy62.3 | 16 | |
| Motion-level Perception | MotionBench (test) | Accuracy58 | 14 | |
| Motion-level Perception | MotionBench (dev) | Accuracy58 | 14 | |
| Video Motion Reasoning | MotionBench (dev) | Overall Accuracy63 | 10 | |
| Motion-Customized Video Generation | MotionBench | Motion Accuracy68.6 | 7 | |
| Video Motion Transfer | MotionBench | Text Similarity0.38 | 7 | |
| Customized Video Generation | MotionBench (User Study) | Motion Alignment78.3 | 5 | |
| Video Understanding | MotionBench | Overall Score70.6 | 4 | |
| Video Question Answering | MotionBench | ALL47.41 | 3 | |
| Short video understanding | MotionBench | Accuracy68.4 | 3 | |
| Video Captioning | MotionBench original (test) | Better-GPT52.3 | 2 |