Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions

About

We present a method that can recognize new objects and estimate their 3D pose in RGB images even under partial occlusions. Our method requires neither a training phase on these objects nor real images depicting them, only their CAD models. It relies on a small set of training objects to learn local object representations, which allow us to locally match the input image to a set of "templates", rendered images of the CAD models for the new objects. In contrast with the state-of-the-art methods, the new objects on which our method is applied can be very different from the training objects. As a result, we are the first to show generalization without retraining on the LINEMOD and Occlusion-LINEMOD datasets. Our analysis of the failure modes of previous template-based approaches further confirms the benefits of local features for template matching. We outperform the state-of-the-art template matching methods on the LINEMOD, Occlusion-LINEMOD and T-LESS datasets. Our source code and data are publicly available at https://github.com/nv-nguyen/template-pose

Van Nguyen Nguyen, Yinlin Hu, Yang Xiao, Mathieu Salzmann, Vincent Lepetit• 2022

Related benchmarks

TaskDatasetResultRank
Object Pose EstimationLINEMOD (LM) 1.0 (Split #1 Unseen)
Accuracy (Acc15)94.4
22
Object Pose EstimationT-LESS (seen (obj. 1-18) and novel (obj. 19-30))
VSD Recall (Seen)60.15
11
Object Pose EstimationLINEMOD (LM) 1.0 (Split #1 Seen)
Accuracy (Acc15)99.3
11
Object Pose EstimationLINEMOD 1.0 (Average Seen)
Acc@1599.1
11
Object Pose EstimationOccluded-LINEMOD (Split #1 Seen)
Accuracy (Acc15)77.3
11
Object Pose EstimationOccluded-LINEMOD (Split #1 Unseen)
Acc@1571.4
11
Object Pose EstimationOccluded-LINEMOD (Split #2 Seen)
Acc15 Accuracy84.1
11
Object Pose EstimationOccluded-LINEMOD Unseen (Split #2)
Acc@1572.7
11
Object Pose EstimationOccluded-LINEMOD (Split #3 Seen)
Accuracy (Acc15)76.8
11
Object Pose EstimationOccluded-LINEMOD (Split #3 Unseen)
Acc15 Accuracy85.3
11
Showing 10 of 12 rows

Other info

Follow for update