| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Visual Reasoning | VSP | Accuracy83.7 | 17 | |
| Multimodal Reasoning | VSP IID | Accuracy82.8 | 14 | |
| Visual Understanding | VSP | Accuracy75.83 | 11 | |
| Visual Reasoning | VSP-Super | Accuracy (Scale 16)100 | 10 | |
| Visual Reasoning | VSP | Accuracy (Scale 3)100 | 10 | |
| Visual Spatial Perception | VSP Total | Accuracy (Total)65.83 | 9 | |
| Visual Spatial Perception | VSP Unseen | Accuracy (Level 7)43 | 9 | |
| Visual Spatial Perception | VSP Seen | Accuracy (Level 3)95 | 9 | |
| Visual Spatial Planning | VSP (test) | Average Accuracy99 | 9 | |
| Visual Spatial Planning | VSP | Accuracy73.2 | 7 |