Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning to Drive Anywhere with Model-Based Reannotation

About

Developing broadly generalizable visual navigation policies for robots is a significant challenge, primarily constrained by the availability of large-scale, diverse training data. While curated datasets collected by researchers offer high quality, their limited size restricts policy generalization. To overcome this, we explore leveraging abundant, passively collected data sources, including large volumes of crowd-sourced teleoperation data and unlabeled YouTube videos, despite their potential for lower quality or missing action labels. We propose Model-Based ReAnnotation (MBRA), a framework that utilizes a learned short-horizon, model-based expert model to relabel or generate high-quality actions for these passive datasets. This relabeled data is then distilled into LogoNav, a long-horizon navigation policy conditioned on visual goals or GPS waypoints. We demonstrate that LogoNav, trained using MBRA-processed data, achieves state-of-the-art performance, enabling robust navigation over distances exceeding 300 meters in previously unseen indoor and outdoor environments. Our extensive real-world evaluations, conducted across a fleet of robots (including quadrupeds) in six cities on three continents, validate the policy's ability to generalize and navigate effectively even amidst pedestrians in crowded settings.

Noriaki Hirose, Lydia Ignatova, Kyle Stachowicz, Catherine Glossop, Sergey Levine, Dhruv Shah• 2025

Related benchmarks

TaskDatasetResultRank
Open-loop trajectory predictionMM-CoS (test)
minADE1s0.617
11
Trajectory PredictionCoS Regular (test)
minADE (1s)0.617
8
NavigationCraftBench (test)
Success Rate (SR)35.6
6
Shared Control EvaluationPseudo-simulation 29 scenarios (test)
HO Rate15.1
6
NavigationWANDERLAND Indoor Scenes
Navigation Error (m)5.28
5
NavigationWANDERLAND Outdoor Scenes
Navigation Error (m)19.4
5
Open-loop trajectory predictionCoS-Recovery (test)
minADE (1s)2.586
4
Showing 7 of 7 rows

Other info

Follow for update