Semantic Prior Guided One-View 6D Pose Estimation for Novel Objects

About

In many practical 6D object pose estimation scenarios, we often have access to only a single real-world RGB-D reference view per object, typically without CAD models. Existing methods largely rely on explicit 3D models or multi-view data, which limits their scalability. To address this challenging single-reference model-free setting, we propose \textbf{OneViewAll}, a semantic-prior-guided framework that performs pose estimation via a novel Project-and-Compare paradigm. Instead of relying on computationally expensive CAD-based rendering, our method directly aligns reference and query observations within a projection-equivariant space. OneViewAll progressively integrates hierarchical semantic priors across three levels: (1) \textit{category- and scene-level} priors for efficient hypothesis initialization; (2) \textit{object-level symmetry} priors for geometry completion via mirror fusion; and (3) \textit{patch-level} priors for discriminative refinement. Extensive experiments demonstrate that OneViewAll achieves \textbf{92.5\%} ADD-0.1 accuracy on the LINEMOD dataset using only one real reference view -- significantly outperforming the CVPR 2025 baseline One2Any (52.6\%). It also yields consistent improvements on YCB-V, Real275, and Toyota-Light while maintaining low inference latency. Our results underscore the efficacy of symmetry-aware projection in handling symmetric, texture-less, and occluded objects.

Yang Luo, Yan Gong, Yongsheng Gao• 2026

Related benchmarks

Task	Dataset	Result
6D Pose Estimation	YCB-V	AUC (ADD-S)89.9	29
6-DoF Pose Estimation	BOP LM-O, TUD-L, IC-BIN, HB, YCB-V	LM-O Performance66.4	26
Object Pose Estimation	LineMod (test)	ADD (0.1d) Error89.9	22
6D Object Pose Estimation	LM-O (test)	--	22
6D Object Pose Estimation	REAL275	--	11
6D Object Pose Estimation	Toyota-Light	Average Recall56.4	8

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord