Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Plug-and-Play Label Map Diffusion for Universal Goal-Oriented Navigation

About

In embodied vision, Goal-Oriented Navigation (GON) requires robots to locate a specific goal within an unexplored environment. The primary challenge of GON arises from the need to construct a Bird's-Eye-View (BEV) map to understand the environment while simultaneously localizing an unobserved goal. Existing map-based methods typically employ self-centered semantic maps, often facing challenges such as reliance on complete maps or inconsistent semantic association. To this end, we propose Plug-and-Play Label Map Diffusion (PLMD), which defines a novel map completion diffusion model based on Denoising Diffusion Probabilistic Models (DDPM). PLMD generates obstacle and semantic labels for unobserved regions through a diffusion-based completion process, thereby enabling goal localization even in partially observed environments. Moreover, it mitigates inconsistent semantic association by leveraging structural consistency between known and unknown obstacle layouts and integrating obstacle priors into the semantic denoising process. By substituting predicted labels for unobserved regions, robots can accurately localize the specified objects. Extensive experiments demonstrate that PLMD \textbf{(I)} effectively expands the region of unknown maps, \textbf{(II)} integrates seamlessly into existing navigation strategies that rely on semantic maps, \textbf{(III)} achieves state-of-the-art performance on three GON tasks.

Zhixuan Shen, Yijie Zeng, Shengxiang Luo, Tianrui Li, Haonan Luo• 2026

Related benchmarks

TaskDatasetResultRank
Object NavigationMP3D (val)
Success Rate (SR)42.6
18
Object Goal NavigationHM3D v0.2 (val)
Success Rate (SR)66.5
10
ObjectNavHM3D v0.1 (val)
Success Rate (SR)65.6
9
Instance-ImageNavHM3D v0.2 (val)
Success Rate (SR)77.6
7
Multi-Robot ObjectNavHM3D v0.2 (val)
Success Rate (SR)76.2
5
Goal-oriented navigationHM3D and MP3D Open-vocabulary objects v0.2 (val)
Success Rate (SR)35.4
4
Multi-Robot ObjectNavMP3D (val)
Success Rate (SR)59.1
3
Showing 7 of 7 rows

Other info

Follow for update