Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation

About

Directly regressing all 6 degrees-of-freedom (6DoF) for the object pose (e.g. the 3D rotation and translation) in a cluttered environment from a single RGB image is a challenging problem. While end-to-end methods have recently demonstrated promising results at high efficiency, they are still inferior when compared with elaborate P$n$P/RANSAC-based approaches in terms of pose accuracy. In this work, we address this shortcoming by means of a novel reasoning about self-occlusion, in order to establish a two-layer representation for 3D objects which considerably enhances the accuracy of end-to-end 6D pose estimation. Our framework, named SO-Pose, takes a single RGB image as input and respectively generates 2D-3D correspondences as well as self-occlusion information harnessing a shared encoder and two separate decoders. Both outputs are then fused to directly regress the 6DoF pose parameters. Incorporating cross-layer consistencies that align correspondences, self-occlusion and 6D pose, we can further improve accuracy and robustness, surpassing or rivaling all other state-of-the-art approaches on various challenging datasets.

Yan Di, Fabian Manhardt, Gu Wang, Xiangyang Ji, Nassir Navab, Federico Tombari• 2021

Related benchmarks

TaskDatasetResultRank
6D Pose EstimationYCB-Video
AUC (ADD-S)90.9
148
6DoF Pose EstimationYCB-Video (test)--
72
6D Object Pose EstimationOccludedLINEMOD (test)
ADD(S)62.3
45
6D Object Pose EstimationLM-O (test)--
22
Object Pose EstimationLineMod (test)--
21
6D Object Pose EstimationBOP (T-LESS, ITODD, YCB-V, LM-O) Challenge (test)
LM-O Score61.3
13
6D Pose EstimationYCB-V
AR VSD65.2
5
6-DoF Pose EstimationBOP LINEMOD, Occlusion LINEMOD, YCB-Video
AR VSD (LMO)44
5
6D Pose EstimationLMO
AR VSD44.2
5
6D Pose EstimationLMO and YCB-V
Mean AR66.4
4
Showing 10 of 10 rows

Other info

Code

Follow for update