Constraint-Aware Zero-Shot Vision-Language Navigation in Continuous Environments

About

We address the task of Vision-Language Navigation in Continuous Environments (VLN-CE) under the zero-shot setting. Zero-shot VLN-CE is particularly challenging due to the absence of expert demonstrations for training and minimal environment structural prior to guide navigation. To confront these challenges, we propose a Constraint-Aware Navigator (CA-Nav), which reframes zero-shot VLN-CE as a sequential, constraint-aware sub-instruction completion process. CA-Nav continuously translates sub-instructions into navigation plans using two core modules: the Constraint-Aware Sub-instruction Manager (CSM) and the Constraint-Aware Value Mapper (CVM). CSM defines the completion criteria for decomposed sub-instructions as constraints and tracks navigation progress by switching sub-instructions in a constraint-aware manner. CVM, guided by CSM's constraints, generates a value map on the fly and refines it using superpixel clustering to improve navigation stability. CA-Nav achieves the state-of-the-art performance on two VLN-CE benchmarks, surpassing the previous best method by 12 percent and 13 percent in Success Rate on the validation unseen splits of R2R-CE and RxR-CE, respectively. Moreover, CA-Nav demonstrates its effectiveness in real-world robot deployments across various indoor scenes and instructions.

Kehan Chen, Dong An, Yan Huang, Rongtao Xu, Yifei Su, Yonggen Ling, Ian Reid, Liang Wang• 2024

Related benchmarks

Task	Dataset	Result
Vision-Language Navigation	R2R-CE (val-unseen)	Success Rate (SR)25.3	779
Vision-Language Navigation	RxR-CE (val-unseen)	SR19	512
Vision-Language Navigation	VLN-CE R2R (val unseen)	Navigation Error (NE)7.58	76
Embodied Navigation	R2R-CE	Navigation Error (NE)7.58	19
Vision-and-Language Navigation	VLN-CE (test-unseen)	Navigation Error (NE)7.58	17
Vision-Language Navigation	VLN-CE unseen 100-episode MP3D (val)	Navigation Error (NE)7.58	15
Vision-Language Navigation	VLN-CE (test)	Navigation Error (NE)8.32	14

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord