| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Vision-Language Navigation | TOUCHDOWN (test) | TC1,668 | 17 | |
| Vision-Language Navigation | TOUCHDOWN (dev) | Task Completion Rate (TC)1,948 | 17 | |
| Vision-and-Language Navigation | Touchdown Seen (test) | TC36.9 | 13 | |
| Vision-and-Language Navigation | Touchdown Unseen (test) | nDTW27 | 11 | |
| Vision-and-Language Navigation | Touchdown seen (dev) | SDTW28.3 | 9 | |
| Outdoor Vision-and-Language Navigation | Touchdown (test) | Task Completion Rate (TC)16.2 | 9 | |
| Outdoor Vision-and-Language Navigation | Touchdown (dev) | TC15 | 9 | |
| Vision-and-Language Navigation | Touchdown unseen (dev) | SDTW1.3 | 7 | |
| Instruction Generation | Touchdown (val) | BLEU30.6 | 3 |