Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Shape Prior Deformation for Categorical 6D Object Pose and Size Estimation

About

We present a novel learning approach to recover the 6D poses and sizes of unseen object instances from an RGB-D image. To handle the intra-class shape variation, we propose a deep network to reconstruct the 3D object model by explicitly modeling the deformation from a pre-learned categorical shape prior. Additionally, our network infers the dense correspondences between the depth observation of the object instance and the reconstructed 3D model to jointly estimate the 6D object pose and size. We design an autoencoder that trains on a collection of object models and compute the mean latent embedding for each category to learn the categorical shape priors. Extensive experiments on both synthetic and real-world datasets demonstrate that our approach significantly outperforms the state of the art. Our code is available at https://github.com/mentian/object-deformnet.

Meng Tian, Marcelo H Ang Jr, Gim Hee Lee• 2020

Related benchmarks

TaskDatasetResultRank
Category-level 6D Pose EstimationREAL275 (test)
Pose Acc (5°/5cm)21.6
53
6D Pose and Size EstimationREAL275
5°5cm0.216
50
Category-level 6D Object Pose EstimationREAL275
mAP (5°5cm)21.4
16
Category-level 6D Object Pose EstimationCamera
mAP (5°2cm)54.3
13
3D Object DetectionREAL275
mAP@IoU7527
12
Category-level Object Pose EstimationCamera
Success Rate (5° 2cm)54.3
12
Pose EstimationNOCS (test)
mAP IoU 5077.3
10
Pose EstimationNOCS REAL275 (test)
mAP (IoU=0.50)0.773
10
Category-level 9D Pose EstimationNOCS REAL275 (test)
mAP (5° 5cm)21.4
9
6D Object Pose EstimationCAMERA25
IoU@7583.1
9
Showing 10 of 22 rows

Other info

Follow for update