Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scalable Unseen Objects 6-DoF Absolute Pose Estimation with Robotic Integration

About

Pose estimation-guided unseen object 6-DoF robotic manipulation is a key task in robotics. However, the scalability of current pose estimation methods to unseen objects remains a fundamental challenge, as they generally rely on CAD models or dense reference views of unseen objects, which are difficult to acquire, ultimately limit their scalability. In this paper, we introduce a novel task setup, referred to as SinRef-6D, which addresses 6-DoF absolute pose estimation for unseen objects using only a single pose-labeled reference RGB-D image captured during robotic manipulation. This setup is more scalable yet technically nontrivial due to large pose discrepancies and the limited geometric and spatial information contained in a single view. To address these issues, our key idea is to iteratively establish point-wise alignment in a common coordinate system with state space models (SSMs) as backbones. Specifically, to handle large pose discrepancies, we introduce an iterative object-space point-wise alignment strategy. Then, Point and RGB SSMs are proposed to capture long-range spatial dependencies from a single view, offering superior spatial modeling capability with linear complexity. Once pre-trained on synthetic data, SinRef-6D can estimate the 6-DoF absolute pose of an unseen object using only a single reference view. With the estimated pose, we further develop a hardware-software robotic system and integrate the proposed SinRef-6D into it in real-world settings. Extensive experiments on six benchmarks and in diverse real-world scenarios demonstrate that our SinRef-6D offers superior scalability. Additional robotic grasping experiments further validate the effectiveness of the developed robotic system. The code and robotic demos are available at https://paperreview99.github.io/SinRef-6DoF-Robotic.

Jian Liu, Wei Sun, Kai Zeng, Jin Zheng, Hui Yang, Hossein Rahmani, Ajmal Mian, Lin Wang• 2025

Related benchmarks

TaskDatasetResultRank
6D Pose EstimationYCB-V--
29
6-DoF Pose EstimationBOP LM-O, TUD-L, IC-BIN, HB, YCB-V
LM-O Performance62
26
Object Pose EstimationLineMod (test)
ADD (0.1d) Error90.2
22
6D Pose EstimationLineMOD
ADD-0.1D (ape)86.3
9
6D Object Pose EstimationYCB-V
Master Chef Can Error44.3
9
6-DoF Pose EstimationLM--
9
6D Object Pose EstimationLineMod 89
ADD-0.1D90.3
3
6D Object Pose EstimationYCB-V 94
ADD AUC52.8
3
6D Pose EstimationLM-O
Average Recall (AR)56.5
3
6D Pose EstimationTUD-L
AR77.4
2
Showing 10 of 10 rows

Other info

Follow for update