Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NoMaD: Goal Masked Diffusion Policies for Navigation and Exploration

About

Robotic learning for navigation in unfamiliar environments needs to provide policies for both task-oriented navigation (i.e., reaching a goal that the robot has located), and task-agnostic exploration (i.e., searching for a goal in a novel setting). Typically, these roles are handled by separate models, for example by using subgoal proposals, planning, or separate navigation strategies. In this paper, we describe how we can train a single unified diffusion policy to handle both goal-directed navigation and goal-agnostic exploration, with the latter providing the ability to search novel environments, and the former providing the ability to reach a user-specified goal once it has been located. We show that this unified policy results in better overall performance when navigating to visually indicated goals in novel environments, as compared to approaches that use subgoal proposals from generative models, or prior methods based on latent variable models. We instantiate our method by using a large-scale Transformer-based policy trained on data from multiple ground robots, with a diffusion model decoder to flexibly handle both goal-conditioned and goal-agnostic navigation. Our experiments, conducted on a real-world mobile robot platform, show effective navigation in unseen environments in comparison with five alternative methods, and demonstrate significant improvements in performance and lower collision rates, despite utilizing smaller models than state-of-the-art approaches. For more videos, code, and pre-trained model checkpoints, see https://general-navigation-models.github.io/nomad/

Ajay Sridhar, Dhruv Shah, Catherine Glossop, Sergey Levine• 2023

Related benchmarks

TaskDatasetResultRank
Object Goal NavigationHM3D
Success Rate25
55
Backward Visual Navigation (To Start)Gibson
SR11.2
48
Backward Visual Navigation (To Start)HM3D
SR10.3
48
Forward Visual Navigation (To End)Gibson
SR24.3
48
Any-Point Visual NavigationGibson
SR31.6
24
Any-Point Visual NavigationHM3D
SR23.5
24
Forward Visual Navigation (To End)HM3D
Success Rate (SR)25
24
Image-Goal NavigationMP3D (test)
Success Rate10.95
19
Instance Image-Goal NavigationHM3D v3 (val)
Success Rate (SR)16.8
15
Goal Conditioned Visual NavigationRECON
ATE1.93
11
Showing 10 of 43 rows

Other info

Follow for update