Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Self-Monitoring Navigation Agent via Auxiliary Progress Estimation

About

The Vision-and-Language Navigation (VLN) task entails an agent following navigational instruction in photo-realistic unknown environments. This challenging task demands that the agent be aware of which instruction was completed, which instruction is needed next, which way to go, and its navigation progress towards the goal. In this paper, we introduce a self-monitoring agent with two complementary components: (1) visual-textual co-grounding module to locate the instruction completed in the past, the instruction required for the next action, and the next moving direction from surrounding images and (2) progress monitor to ensure the grounded instruction correctly reflects the navigation progress. We test our self-monitoring agent on a standard benchmark and analyze our proposed approach through a series of ablation studies that elucidate the contributions of the primary components. Using our proposed method, we set the new state of the art by a significant margin (8% absolute increase in success rate on the unseen test set). Code is available at https://github.com/chihyaoma/selfmonitoring-agent .

Chih-Yao Ma, Jiasen Lu, Zuxuan Wu, Ghassan AlRegib, Zsolt Kira, Richard Socher, Caiming Xiong• 2019

Related benchmarks

TaskDatasetResultRank
Vision-and-Language NavigationR2R (val unseen)
Success Rate (SR)47
260
Vision-and-Language NavigationREVERIE (val unseen)
SPL6.44
129
Vision-Language NavigationR2R (test unseen)
SR61
122
Vision-Language NavigationR2R (val seen)
Success Rate (SR)69
120
Vision-Language NavigationR2R Unseen (test)
SR61
116
Vision-and-Language NavigationRoom-to-Room (R2R) Unseen (val)
SR57
52
Vision-and-Language NavigationR2R (val seen)--
51
NavigationREVERIE Unseen (test)
SR5.8
43
Vision-and-Language NavigationR2R (test)
SPL (Success weighted Path Length)35
38
NavigationREVERIE (val unseen)
Success Rate (SR)8.15
34
Showing 10 of 30 rows

Other info

Code

Follow for update