| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Vision-Language Navigation | HA-VLN Unseen (val) | NE4.96 | 13 | |
| Human-Aware Vision-and-Language Navigation | HA-VLN (val seen) | Navigation Error (NE)0.62 | 7 | |
| Vision-Language Navigation | HA-VLN Retrained (val-seen) | NE (w/o human)3.93 | 7 | |
| Vision-Language Navigation | HA-VLN Seen (val) | NE7.23 | 6 | |
| Human-Aware Vision-and-Language Navigation | HA-VLN Unseen 1.0 (val) | Navigation Error (NE)0.67 | 5 | |
| Vision-Language Navigation | HA-VLN (val seen) | NE4.95 | 5 | |
| Vision-Language Navigation | HA-VLN Retrained (val-unseen) | Diff NE-4.2 | 2 |