| Visual Navigation (level-4) | VisuoThink | Pass@161.3 | | 15 | 3mo ago |
| Visual Navigation (level-3) | VisuoThink | Pass@193.8 | | 15 | 3mo ago |
| CitySim Outdoor | RSBM | MSE2.4 | | 11 | 1mo ago |
| Custom Indoor | RSBM | MSE1.72 | | 11 | 1mo ago |
| AI2-THOR Unseen Scenes (L >= 5) (test) | SAVN | SPL13.91 | | 11 | 3mo ago |
| Visual Navigation level-5 | VisuoThink | Pass@15,320 | | 10 | 3mo ago |
| TIR-Bench Maze | Qwen3-VL + V-ABS | Accuracy65 | | 9 | 21d ago |
| Frozen-Lake 8x8 Grid | Qwen3-VL + V-ABS | Accuracy12.5 | | 9 | 21d ago |
| Frozen-Lake 6x6 Grid | GPT-4o + V-ABS | Accuracy25 | | 9 | 21d ago |
| Frozen-Lake 4x4 Grid | Qwen3-VL + V-ABS | Accuracy55 | | 9 | 21d ago |
| VisuoThink Level 5 | GPT-4o + V-ABS | Accuracy69.7 | | 9 | 21d ago |
| VisuoThink (Level 4) | GPT-4o + V-ABS | Accuracy74.1 | | 9 | 21d ago |
| VisuoThink Level 3 | GPT-4o + V-ABS | Accuracy94.9 | | 9 | 21d ago |
| HM3D Shortcut v3 | IntentReact | Success Rate (SR)60.95 | | 9 | 2mo ago |
| HM3D Alt Goal v3 | IntentReact | SR34.26 | | 9 | 2mo ago |
| CAST (val) | PiJEPA | ATE XY RMSE (m)1.65 | | 8 | 2mo ago |
| AI2-THOR L ≥ 5 (val) | MVV-IN (Segmentation + Region Proposal) | SPL14.86 | | 8 | 3mo ago |
| AI2-THOR All (val) | MVV-IN (Segmentation + Region Feature) | SPL0.1727 | | 8 | 3mo ago |
| AI2-THOR Unseen Scenes (All) (test) | SAVN | SPL16.15 | | 7 | 3mo ago |
| Full GOAT-Bench Unseen (val) | GOAT-GTSem | SR54.3 | | 6 | 2mo ago |
| Full GOAT-Bench Synonyms (val) | GOAT-GTSem | Success Rate (SR)58.4 | | 6 | 2mo ago |
| Full GOAT-Bench Seen (val) | GOAT-GTSem | SR56.7 | | 6 | 2mo ago |
| GOAT-Core (val) | GOAT-GT Sem* | SR75 | | 6 | 2mo ago |
| Library Dynamic | ReaDy-Go | Success Rate80 | | 6 | 3mo ago |
| Library Static | | SR1 | | 6 | 3mo ago |