Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions
About
We present a method that can recognize new objects and estimate their 3D pose in RGB images even under partial occlusions. Our method requires neither a training phase on these objects nor real images depicting them, only their CAD models. It relies on a small set of training objects to learn local object representations, which allow us to locally match the input image to a set of "templates", rendered images of the CAD models for the new objects. In contrast with the state-of-the-art methods, the new objects on which our method is applied can be very different from the training objects. As a result, we are the first to show generalization without retraining on the LINEMOD and Occlusion-LINEMOD datasets. Our analysis of the failure modes of previous template-based approaches further confirms the benefits of local features for template matching. We outperform the state-of-the-art template matching methods on the LINEMOD, Occlusion-LINEMOD and T-LESS datasets. Our source code and data are publicly available at https://github.com/nv-nguyen/template-pose
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Pose Estimation | LINEMOD (LM) 1.0 (Split #1 Unseen) | Accuracy (Acc15)94.4 | 22 | |
| Object Pose Estimation | T-LESS (seen (obj. 1-18) and novel (obj. 19-30)) | VSD Recall (Seen)60.15 | 11 | |
| Object Pose Estimation | LINEMOD (LM) 1.0 (Split #1 Seen) | Accuracy (Acc15)99.3 | 11 | |
| Object Pose Estimation | LINEMOD 1.0 (Average Seen) | Acc@1599.1 | 11 | |
| Object Pose Estimation | Occluded-LINEMOD (Split #1 Seen) | Accuracy (Acc15)77.3 | 11 | |
| Object Pose Estimation | Occluded-LINEMOD (Split #1 Unseen) | Acc@1571.4 | 11 | |
| Object Pose Estimation | Occluded-LINEMOD (Split #2 Seen) | Acc15 Accuracy84.1 | 11 | |
| Object Pose Estimation | Occluded-LINEMOD Unseen (Split #2) | Acc@1572.7 | 11 | |
| Object Pose Estimation | Occluded-LINEMOD (Split #3 Seen) | Accuracy (Acc15)76.8 | 11 | |
| Object Pose Estimation | Occluded-LINEMOD (Split #3 Unseen) | Acc15 Accuracy85.3 | 11 |