| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Embodied Reasoning and Question Answering | ERQA | Score65 | 30 | |
| Embodied reasoning | ERQA (test) | Accuracy70.25 | 12 | |
| Embodied Reasoning | ERQA | Accuracy54.5 | 11 | |
| Spatial Reasoning (Multi-Image) | ERQA | Accuracy42.2 | 8 | |
| Embodied Visual Question Answering | ERQA | Accuracy51.5 | 4 | |
| General | ERQA | Score41.6 | 4 |