Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

One2Any: One-Reference 6D Pose Estimation for Any Object

About

6D object pose estimation remains challenging for many applications due to dependencies on complete 3D models, multi-view images, or training limited to specific object categories. These requirements make generalization to novel objects difficult for which neither 3D models nor multi-view images may be available. To address this, we propose a novel method One2Any that estimates the relative 6-degrees of freedom (DOF) object pose using only a single reference-single query RGB-D image, without prior knowledge of its 3D model, multi-view data, or category constraints. We treat object pose estimation as an encoding-decoding process, first, we obtain a comprehensive Reference Object Pose Embedding (ROPE) that encodes an object shape, orientation, and texture from a single reference view. Using this embedding, a U-Net-based pose decoding module produces Reference Object Coordinate (ROC) for new views, enabling fast and accurate pose estimation. This simple encoding-decoding framework allows our model to be trained on any pair-wise pose data, enabling large-scale training and demonstrating great scalability. Experiments on multiple benchmark datasets demonstrate that our model generalizes well to novel objects, achieving state-of-the-art accuracy and robustness even rivaling methods that require multi-view or CAD inputs, at a fraction of compute.

Mengya Liu, Siyuan Li, Ajad Chhatkuli, Prune Truong, Luc Van Gool, Federico Tombari• 2025

Related benchmarks

TaskDatasetResultRank
6D Object Pose EstimationLineMOD
Cam Accuracy72.7
60
6D Object Pose EstimationToyota-Light (TOYL) (test)
AR42
24
Object Pose EstimationLineMod (test)
ADD (0.1d) Error52.6
22
6D Pose TrackingYCB-Video (All Frames)
AUC (ADD)84.8
14
6D Object Pose EstimationREAL275
ADD(-S)41
11
6-DoF Pose EstimationLM--
9
6D Pose Estimationoccluded YCB-Video (test)
ADD-S93.7
8
6D Object Pose EstimationToyota-Light
Average Recall42
8
Relative Pose EstimationToyota-Light
ADD(-S)34.6
7
6D Object Pose EstimationGeneral Inference Efficiency Benchmark (test)
Inference Time (s)0.09
6
Showing 10 of 12 rows

Other info

Code

Follow for update