| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Self-evaluation | CVBench | AUROC0.747 | 36 | |
| Vision Understanding | CVBench-2D | Accuracy77.76 | 22 | |
| Spatial Reasoning | CVBench | 2D Relationship Score96.31 | 15 | |
| Vision | CVBench | CVBench Score77.2 | 13 | |
| Vision-Language Reasoning | CVBench | Accuracy86.16 | 12 | |
| Perception | CVBench (test) | Accuracy87.6 | 11 | |
| Visual Perception | CVBench | 2D Score76.6 | 5 | |
| Visual Question Answering | CVBench 3D | Accuracy80.9 | 4 | |
| Visual Question Answering | CVBench 2D | Accuracy76.7 | 4 | |
| 2D Vision Reasoning | CVBench 2D | Accuracy58.5 | 4 | |
| Counting | CVBench | Counting Score73.8 | 3 |