| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Spatial and Temporal Understanding | VSTI-Bench | Camera Movement Accuracy (MC)88.1 | 28 | |
| Spatial Reasoning | VSTI-Bench | Cam. Mov. Dir. Error24.4 | 17 | |
| Temporal spatial reasoning | VSTI-Bench (test) | Average Score46.8 | 13 | |
| Spatiotemporal Intelligence | VSTI-Bench | Accuracy77 | 6 | |
| 3D/4D Video Question Answering | VSTI-Bench | Accuracy59.1 | 5 | |
| Spatial and Temporal Understanding | VSTI-Bench Tiny | Average Score77 | 2 |