GNM: A General Navigation Model to Drive Any Robot

About

Learning provides a powerful tool for vision-based navigation, but the capabilities of learning-based policies are constrained by limited training data. If we could combine data from all available sources, including multiple kinds of robots, we could train more powerful navigation models. In this paper, we study how a general goal-conditioned model for vision-based navigation can be trained on data obtained from many distinct but structurally similar robots, and enable broad generalization across environments and embodiments. We analyze the necessary design decisions for effective data sharing across robots, including the use of temporal context and standardized action spaces, and demonstrate that an omnipolicy trained from heterogeneous datasets outperforms policies trained on any single dataset. We curate 60 hours of navigation trajectories from 6 distinct robots, and deploy the trained GNM on a range of new robots, including an underactuated quadrotor. We find that training on diverse data leads to robustness against degradation in sensing and actuation. Using a pre-trained navigation model with broad generalization capabilities can bootstrap applications on novel robots going forward, and we hope that the GNM represents a step in that direction. For more information on the datasets, code, and videos, please check out our project page https://sites.google.com/view/drive-any-robot.

Dhruv Shah, Ajay Sridhar, Arjun Bhorkar, Noriaki Hirose, Sergey Levine• 2022

Related benchmarks

Task	Dataset	Result
Image-Goal Navigation	MP3D (test)	Success Rate10.06	32
Goal Conditioned Visual Navigation	SCAND	ATE2.12	18
Trajectory Prediction	RECON (unseen)	L2 Error (m)1.754	17
Trajectory Prediction	GoStanford (unseen)	L2 Error (m)3.375	17
Goal Conditioned Visual Navigation	RECON	ATE1.87	16
Instance Image-Goal Navigation	HM3D v3 (val)	Success Rate (SR)11.4	15
Trajectory Prediction	SCAND	ATE2.12	12
Trajectory Prediction	RECON	ATE1.85	12
Goal Conditioned Visual Navigation	Go Stanford (evaluation)	Absolute Trajectory Error (ATE)1.11	12
Open-loop trajectory prediction	MM-CoS (test)	minADE1s0.594	11

Showing 10 of 68 rows

Other info

Follow for update

@wizwand_team Discord