Co-op: Correspondence-based Novel Object Pose Estimation
About
We propose Co-op, a novel method for accurately and robustly estimating the 6DoF pose of objects unseen during training from a single RGB image. Our method requires only the CAD model of the target object and can precisely estimate its pose without any additional fine-tuning. While existing model-based methods suffer from inefficiency due to using a large number of templates, our method enables fast and accurate estimation with a small number of templates. This improvement is achieved by finding semi-dense correspondences between the input image and the pre-rendered templates. Our method achieves strong generalization performance by leveraging a hybrid representation that combines patch-level classification and offset regression. Additionally, our pose refinement model estimates probabilistic flow between the input image and the rendered image, refining the initial estimate to an accurate pose using a differentiable PnP layer. We demonstrate that our method not only estimates object poses rapidly but also outperforms existing methods by a large margin on the seven core datasets of the BOP Challenge, achieving state-of-the-art accuracy.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 6D Object Pose Estimation | BOP 7 core datasets: LM-O, T-LESS, TUD-L, IC-BIN, ITODD, HB, YCB-V 82 (test) | AR (LM-O)73 | 47 | |
| Object Pose Estimation | TUD-L BOP (test) | mAR64.2 | 23 | |
| 6D Object Pose Estimation | BOP challenge (test) | LM-O AR65.5 | 18 | |
| Multi-view 6D pose estimation | T-LESS BOP (test) | AR68.2 | 12 | |
| Multi-view 6D pose estimation | YCB-V BOP (test) | AR69.7 | 12 | |
| 6D Localization | T-LESS BOP (test) | AR59.2 | 9 | |
| 6D Localization | YCB-V BOP (test) | AR62.6 | 9 | |
| 6D Detection of Unseen Objects | T-LESS | AP0.62 | 6 | |
| 6D Detection of Unseen Objects | TUD-L | AP0.841 | 6 | |
| 6D Detection of Unseen Objects | YCB-V | AP80.8 | 6 |