OneViewAll: Semantic Prior Guided One-View 6D Pose Estimation for Novel Objects
About
In many practical 6D object pose estimation scenarios, we often have access to only a single real-world RGB-D reference view per object, typically without CAD models. Existing methods largely rely on explicit 3D models or multi-view data, which limits their scalability. To address this challenging single-reference model-free setting, we propose \textbf{OneViewAll}, a semantic-prior-guided framework that performs pose estimation via a novel Project-and-Compare paradigm. Instead of relying on computationally expensive CAD-based rendering, our method directly aligns reference and query observations within a projection-equivariant space. OneViewAll progressively integrates hierarchical semantic priors across three levels: (1) \textit{category- and scene-level} priors for efficient hypothesis initialization; (2) \textit{object-level symmetry} priors for geometry completion via mirror fusion; and (3) \textit{patch-level} priors for discriminative refinement. Extensive experiments demonstrate that OneViewAll achieves \textbf{92.5\%} ADD-0.1 accuracy on the LINEMOD dataset using only one real reference view -- significantly outperforming the CVPR 2025 baseline One2Any (52.6\%). It also yields consistent improvements on YCB-V, Real275, and Toyota-Light while maintaining low inference latency. Our results underscore the efficacy of symmetry-aware projection in handling symmetric, texture-less, and occluded objects.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 6D Pose Estimation | YCB-V | AUC (ADD-S)89.9 | 29 | |
| 6-DoF Pose Estimation | BOP LM-O, TUD-L, IC-BIN, HB, YCB-V | LM-O Performance66.4 | 26 | |
| Object Pose Estimation | LineMod (test) | ADD (0.1d) Error89.9 | 22 | |
| 6D Object Pose Estimation | LM-O (test) | -- | 22 | |
| 6D Object Pose Estimation | REAL275 | -- | 11 | |
| 6D Object Pose Estimation | Toyota-Light | Average Recall56.4 | 8 |