Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-path Learning for Object Pose Estimation Across Domains

About

We introduce a scalable approach for object pose estimation trained on simulated RGB views of multiple 3D models together. We learn an encoding of object views that does not only describe an implicit orientation of all objects seen during training, but can also relate views of untrained objects. Our single-encoder-multi-decoder network is trained using a technique we denote "multi-path learning": While the encoder is shared by all objects, each decoder only reconstructs views of a single object. Consequently, views of different instances do not have to be separated in the latent space and can share common features. The resulting encoder generalizes well from synthetic to real data and across various instances, categories, model types and datasets. We systematically investigate the learned encodings, their generalization, and iterative refinement strategies on the ModelNet40 and T-LESS dataset. Despite training jointly on multiple objects, our 6D Object Detection pipeline achieves state-of-the-art results on T-LESS at much lower runtimes than competing approaches.

Martin Sundermeyer, Maximilian Durner, En Yen Puang, Zoltan-Csaba Marton, Narunas Vaskevicius, Kai O. Arras, Rudolph Triebel• 2019

Related benchmarks

TaskDatasetResultRank
6-DoF Pose EstimationYCB-V BOP challenge 2020
AR28.9
37
Object Pose EstimationLINEMOD (LM) 1.0 (Split #1 Unseen)
Accuracy (Acc15)38.3
22
6D Object Pose EstimationT-LESS BOP challenge protocol PrimeSense (test)
VSD69.5
20
6D Pose EstimationHomebrewed BOP challenge (test)
Avg Recall29.3
20
6D Pose EstimationOcclusion dataset BOP challenge (test)
AR21.7
19
Object Pose EstimationT-LESS (seen (obj. 1-18) and novel (obj. 19-30))
VSD Recall (Seen)43.17
11
Object Pose EstimationLINEMOD (LM) 1.0 (Split #1 Seen)
Accuracy (Acc15)91.5
11
Object Pose EstimationLINEMOD 1.0 (Average Seen)
Acc@1588.9
11
Object Pose EstimationOccluded-LINEMOD (Split #1 Seen)
Accuracy (Acc15)31.3
11
Object Pose EstimationOccluded-LINEMOD (Split #1 Unseen)
Acc@1518.6
11
Showing 10 of 17 rows

Other info

Follow for update