| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text-centric Reasoning | VideoThinkBench mini (test) | Average Score89 | 22 | |
| Vision-centric tasks | VideoThinkBench mini (test) | Average Score37.3 | 18 | |
| Video generation reasoning | VideoThinkBench (test) | Eyeballing-Point Accuracy44.7 | 4 |