FlowNav: Combining Flow Matching and Depth Priors for Efficient Navigation
About
Effective robot navigation in unseen environments is a challenging task that requires precise control actions at high frequencies. Recent advances have framed it as an image-goal-conditioned control problem, where the robot generates navigation actions using frontal RGB images. Current state-of-the-art methods in this area use diffusion policies to generate these control actions. Despite their promising results, these models are computationally expensive and suffer from weak perception. To address these limitations, we present FlowNav, a novel approach that uses a combination of CFM and depth priors from off-the-shelf foundation models to learn action policies for robot navigation. FlowNav is significantly more accurate and faster at navigation and exploration than state-of-the-art methods. We validate our contributions using real robot experiments in multiple environments, demonstrating improved navigation reliability and accuracy. Code and trained models are publicly available.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Point-Goal navigation | Citysim Outdoor Adaptation Task | SR65 | 5 | |
| Point-Goal navigation | Stanford 2D-3D-S Indoor Basic Task | SR0.9 | 5 | |
| Point-Goal navigation | Stanford 2D-3D-S Indoor (Adaptation Task) | Success Rate85 | 5 | |
| Point-Goal navigation | Citysim Outdoor Basic Task | SR (%)0.4 | 5 |