EPOS: Estimating 6D Pose of Objects with Symmetries
About
We present a new method for estimating the 6D pose of rigid objects with available 3D models from a single RGB input image. The method is applicable to a broad range of objects, including challenging ones with global or partial symmetries. An object is represented by compact surface fragments which allow handling symmetries in a systematic manner. Correspondences between densely sampled pixels and the fragments are predicted using an encoder-decoder network. At each pixel, the network predicts: (i) the probability of each object's presence, (ii) the probability of the fragments given the object's presence, and (iii) the precise 3D location on each fragment. A data-dependent number of corresponding 3D locations is selected per pixel, and poses of possibly multiple object instances are estimated using a robust and efficient variant of the PnP-RANSAC algorithm. In the BOP Challenge 2019, the method outperforms all RGB and most RGB-D and D methods on the T-LESS and LM-O datasets. On the YCB-V dataset, it is superior to all competitors, with a large margin over the second-best RGB method. Source code is at: cmp.felk.cvut.cz/epos.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 6D Object Pose Estimation | BOP Core Datasets Challenge (test) | LM-O Score54.7 | 42 | |
| 6-DoF Pose Estimation | YCB-V BOP challenge 2020 | AR69.6 | 37 | |
| 6D Pose Estimation | Homebrewed BOP challenge (test) | Avg Recall58 | 20 | |
| 6D Pose Estimation | Occlusion dataset BOP challenge (test) | AR54.7 | 19 | |
| 6D Object Pose Estimation | BOP (T-LESS, ITODD, YCB-V, LM-O) Challenge (test) | LM-O Score54.7 | 13 | |
| 6D Pose Estimation | YCB-V | AR VSD62.6 | 5 | |
| 6-DoF Pose Estimation | BOP LINEMOD, Occlusion LINEMOD, YCB-Video | AR VSD (LMO)39 | 5 | |
| 6D Pose Estimation | LMO | AR VSD38.9 | 5 | |
| 6D Pose Estimation | LMO and YCB-V | Mean AR62.1 | 4 |