Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Iterative Vision-and-Language Navigation

About

We present Iterative Vision-and-Language Navigation (IVLN), a paradigm for evaluating language-guided agents navigating in a persistent environment over time. Existing Vision-and-Language Navigation (VLN) benchmarks erase the agent's memory at the beginning of every episode, testing the ability to perform cold-start navigation with no prior information. However, deployed robots occupy the same environment for long periods of time. The IVLN paradigm addresses this disparity by training and evaluating VLN agents that maintain memory across tours of scenes that consist of up to 100 ordered instruction-following Room-to-Room (R2R) episodes, each defined by an individual language instruction and a target path. We present discrete and continuous Iterative Room-to-Room (IR2R) benchmarks comprising about 400 tours each in 80 indoor scenes. We find that extending the implicit memory of high-performing transformer VLN agents is not sufficient for IVLN, but agents that build maps can benefit from environment persistence, motivating a renewed focus on map-building agents in VLN.

Jacob Krantz, Shurjo Banerjee, Wang Zhu, Jason Corso, Peter Anderson, Stefan Lee, Jesse Thomason• 2022

Related benchmarks

TaskDatasetResultRank
Vision-and-Language NavigationREVERIE (val unseen)
SPL19
173
Vision-and-Language NavigationREVERIE seen (val)
SR23
28
Vision-and-Language NavigationGSA-R2R N-Scene (test)
SR9.7
26
Vision-and-Language NavigationIR2R (val-seen)
Trajectory Length (TL)9.4
21
Vision-and-Language NavigationIR2R (val-unseen)
TL9.4
21
Iterative Vision-and-Language NavigationIR2R-CE (val seen)
TL7.2
15
Vision-Language NavigationGSA-R2R User Instructions Residential v1 (test)
SR14.7
12
Vision-Language NavigationGSA-R2R Basic Instructions Residential v1 (test)
SR14.9
12
Vision-Language NavigationGSA-R2R Basic Instructions Non-Residential v1 (test)
SR11
12
Vision-and-Language NavigationGSA-R2R N-Basic (test)
TL9.4
10
Showing 10 of 20 rows

Other info

Follow for update