Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Nav-R1: Reasoning and Navigation in Embodied Scenes

About

Embodied navigation requires agents to integrate perception, reasoning, and action for robust interaction in complex 3D environments. Existing approaches often suffer from incoherent and unstable reasoning traces that hinder generalization across diverse environments, and difficulty balancing long-horizon semantic reasoning with low-latency control for real-time navigation. To address these challenges, we propose Nav-R1, an embodied foundation model that unifies reasoning in embodied environments. We first construct Nav-CoT-110K, a large-scale dataset of step-by-step Chains-of-Thought (CoT) for embodied tasks, which enables cold-start initialization with structured reasoning. Building on this foundation, we design a GRPO-based reinforcement learning framework with three complementary rewards: format, understanding, and navigation, to improve structural adherence, semantic grounding, and path fidelity. Furthermore, we introduce a Fast-in-Slow reasoning paradigm, decoupling deliberate semantic reasoning from low-latency reactive control for efficient yet coherent navigation. Extensive evaluations on embodied AI benchmarks demonstrate that Nav-R1 consistently outperforms strong baselines, with over 8% average improvement in reasoning and navigation performance. Real-world deployment on a mobile robot further validates its robustness under limited onboard resources. Code: https://github.com/AIGeeksGroup/Nav-R1. Website: https://aigeeksgroup.github.io/Nav-R1.

Qingxiang Liu, Ting Huang, Zeyu Zhang, Hao Tang• 2025

Related benchmarks

TaskDatasetResultRank
Object Goal NavigationHM3D-OVON Seen (val)
SR58.4
44
Object Goal NavigationHM3D-OVON unseen (val)
Success Rate42.2
43
Object Goal NavigationHM3D-OVON Seen-Synonyms (val)
SR48.1
35
Open-set ObjectGoal NavigationHM3D-OVON unseen (val)
SR42.2
28
Open-Vocabulary Object Goal NavigationHM3D-OVON (val-seen)
SR58.4
21
Open-Vocabulary Object Goal NavigationHM3D-OVON seen-syn (val)
SR48.1
21
Open-Vocabulary Object NavigationOVON unseen (val)
SR42.2
10
Open-Vocabulary Object NavigationOVON seen (val)
SR58.4
9
Open-Vocabulary Object NavigationOVON Synonyms (val)
SR48.1
9
Open-Vocabulary Object Goal NavigationHM3D OVON (test)
SR42.2
7
Showing 10 of 10 rows

Other info

Follow for update