| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Vision-and-Language Navigation | REVERIE (val unseen) | SPL53.2 | 129 | |
| Navigation | REVERIE Unseen (test) | SR81.51 | 43 | |
| Vision-and-Language Navigation | REVERIE unseen (test) | Success Rate (SR)57.72 | 40 | |
| Navigation | REVERIE (val unseen) | Success Rate (SR)61 | 34 | |
| Remote Grounding | REVERIE unseen (test) | RGS77.84 | 33 | |
| Vision-and-Language Navigation | REVERIE seen (val) | SR78.64 | 28 | |
| Remote Grounding | REVERIE unseen (val) | RGS34.51 | 22 | |
| Vision-and-Language Navigation | REVERIE (test) | SPL3,606 | 17 | |
| Remote Grounding | REVERIE (val seen) | RGS61 | 15 | |
| Remote Embodied Visual Referencing | REVERIE unseen 1.0 (val) | RGS22.41 | 15 | |
| Remote Embodied Visual Referencing | REVERIE 1.0 (val seen) | Succ.61.91 | 15 | |
| Navigation | REVERIE (val seen) | Success Rate (SR)50.53 | 14 | |
| Vision-and-Language Navigation | REVERIE Unseen Discrete (val) | OSR63.9 | 10 | |
| Object Localization | REVERIE (test unseen) | RGS16.83 | 8 | |
| Object Localization | REVERIE (val unseen) | RGS18.23 | 8 | |
| Object Localization | REVERIE (val seen) | RGS32.75 | 8 | |
| Vision-and-Language Navigation | REVERIE CE (val unseen) | NE6.5 | 8 | |
| Remote Embodied Visual Referencing | REVERIE 1.0 (test unseen) | Succ.81.51 | 6 | |
| Vision-and-Language Navigation | REVERIE-CE (val seen) | NE5.38 | 5 |