Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ViPlanner: Visual Semantic Imperative Learning for Local Navigation

About

Real-time path planning in outdoor environments still challenges modern robotic systems due to differences in terrain traversability, diverse obstacles, and the necessity for fast decision-making. Established approaches have primarily focused on geometric navigation solutions, which work well for structured geometric obstacles but have limitations regarding the semantic interpretation of different terrain types and their affordances. Moreover, these methods fail to identify traversable geometric occurrences, such as stairs. To overcome these issues, we introduce ViPlanner, a learned local path planning approach that generates local plans based on geometric and semantic information. The system is trained using the Imperative Learning paradigm, for which the network weights are optimized end-to-end based on the planning task objective. This optimization uses a differentiable formulation of a semantic costmap, which enables the planner to distinguish between the traversability of different terrains and accurately identify obstacles. The semantic information is represented in 30 classes using an RGB colorspace that can effectively encode the multiple levels of traversability. We show that the planner can adapt to diverse real-world environments without requiring any real-world training. In fact, the planner is trained purely in simulation, enabling a highly scalable training data generation. Experimental results demonstrate resistance to noise, zero-shot sim-to-real transfer, and a decrease of 38.02% in terms of traversability cost compared to purely geometric-based approaches. Code and models are made publicly available: https://github.com/leggedrobotics/viplanner.

Pascal Roth, Julian Nubert, Fan Yang, Mayank Mittal, Marco Hutter• 2023

Related benchmarks

TaskDatasetResultRank
Point-Goal navigationInternScenes Home (test)
SR4.50e+3
15
Point-Goal navigationInternVLA-N1 Commercial
Success Rate (SR)63.7
9
Point-Goal navigationInternScenes Commercial (test)
SR0.637
6
Point-Goal navigationInternScenes-Home Unseen v1.0 (test)
Success Rate40.2
6
Robot navigation3D-FRONT
Scene 1 Success Rate (SR)14
5
Vision-based NavigationReal-world Home Unitree Go2
Success Rate (SR)45
5
Visual NavigationNavDP Intern-Home
Success Rate (SR)45
5
Visual NavigationNavDP Intern-Commercial
Success Rate (SR)63.7
5
Visual NavigationNavDP Cluttered-Easy
Success Rate (SR)80.2
5
Visual NavigationNavDP Cluttered-Hard
Success Rate (SR)64.7
5
Showing 10 of 15 rows

Other info

Follow for update