Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation

About

Establishing correspondences from image to 3D has been a key task of 6DoF object pose estimation for a long time. To predict pose more accurately, deeply learned dense maps replaced sparse templates. Dense methods also improved pose estimation in the presence of occlusion. More recently researchers have shown improvements by learning object fragments as segmentation. In this work, we present a discrete descriptor, which can represent the object surface densely. By incorporating a hierarchical binary grouping, we can encode the object surface very efficiently. Moreover, we propose a coarse to fine training strategy, which enables fine-grained correspondence prediction. Finally, by matching predicted codes with object surface and using a PnP solver, we estimate the 6DoF pose. Results on the public LM-O and YCB-V datasets show major improvement over the state of the art w.r.t. ADD(-S) metric, even surpassing RGB-D based methods in some cases.

Yongzhi Su, Mahdi Saleh, Torben Fetzer, Jason Rambach, Nassir Navab, Benjamin Busam, Didier Stricker, Federico Tombari• 2022

Related benchmarks

TaskDatasetResultRank
6D Pose EstimationYCB-Video
AUC (ADD-S)90.1
148
6DoF Pose EstimationYCB-Video (test)--
72
6D Object Pose EstimationBOP Core Datasets Challenge (test)
LM-O Score75.2
42
6D Pose EstimationBOP challenge
LM-O72.9
39
6D Object Pose EstimationLM-O (test)
Recall (Mean)76.9
22
6D Pose EstimationBOP Benchmark (test)
LM-O Score75.2
11
6D Object Pose EstimationLineMOD-O
AR72.1
7
Showing 7 of 7 rows

Other info

Code

Follow for update