| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Spatial Reasoning | RoboSpatial | Overall Score72.43 | 36 | |
| Grounded Visual Question Answering | RoboSpatial-Home | Context Score54.92 | 16 | |
| Spatial VQA | RoboSpatial | Accuracy86.18 | 14 | |
| Spatial Reasoning | RoboSpatial (val) | Accuracy (RoboSpatial val)66.7 | 12 | |
| Spatial Reasoning | RoboSpatial | Accuracy66.7 | 12 | |
| Visual Reasoning | ROBOSPATIAL | Accuracy69.5 | 10 | |
| Robot Spatial Reasoning | ROBOSPATIAL | Accuracy76.3 | 10 | |
| Spatial Reasoning | RoboSpatial | Confidence82.11 | 9 | |
| Spatial Reasoning | Robospatial (ood) | Accuracy70.2 | 8 | |
| Spatial Affordance Prediction | RoboSpatial | SR21.65 | 7 | |
| Embodied Understanding | RoboSpatial-Home | Score76.6 | 5 | |
| Robotic Spatial Reasoning | RoboSpatial | mAP (Mask)62.57 | 5 | |
| Visual Question Answering | ROBOSPATIAL | Accuracy69.5 | 4 |