| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Spatial Reasoning | VSI-Bench 1.0 (test) | Relative Distance Error25 | 37 | |
| Spatial Reasoning | VSI-Bench | Accuracy79.2 | 24 | |
| Spatial Reasoning | VSI-Bench Vanilla regime | Avg Score50.5 | 19 | |
| Spatial Reasoning | VSI-Bench tiny | Route Plan46.94 | 15 | |
| Spatial Reasoning (Video) | VSI-Bench | Accuracy68.3 | 14 | |
| Fine-grained video-based spatial reasoning | VSI-Bench | Avg Score60.6 | 13 | |
| Video Spatial Intelligence | VSI-Bench 123 (test) | Object Count70 | 13 | |
| Video Understanding | VSI-Bench | Accuracy49.5 | 11 | |
| multi-view Visual Question Answering | VSI-Bench (test) | Average Score52.9 | 11 | |
| Spatial Reasoning | VSI-Bench (test) | Avg Score44 | 4 |