Co-op: Correspondence-based Novel Object Pose Estimation

About

We propose Co-op, a novel method for accurately and robustly estimating the 6DoF pose of objects unseen during training from a single RGB image. Our method requires only the CAD model of the target object and can precisely estimate its pose without any additional fine-tuning. While existing model-based methods suffer from inefficiency due to using a large number of templates, our method enables fast and accurate estimation with a small number of templates. This improvement is achieved by finding semi-dense correspondences between the input image and the pre-rendered templates. Our method achieves strong generalization performance by leveraging a hybrid representation that combines patch-level classification and offset regression. Additionally, our pose refinement model estimates probabilistic flow between the input image and the rendered image, refining the initial estimate to an accurate pose using a differentiable PnP layer. We demonstrate that our method not only estimates object poses rapidly but also outperforms existing methods by a large margin on the seven core datasets of the BOP Challenge, achieving state-of-the-art accuracy.

Sungphill Moon, Hyeontae Son, Dongcheol Hur, Sangwook Kim• 2025

Related benchmarks

Task	Dataset	Result
6D Object Pose Estimation	BOP 7 core datasets: LM-O, T-LESS, TUD-L, IC-BIN, ITODD, HB, YCB-V 82 (test)	AR (LM-O)73	47
6D Object Pose Estimation	BOP challenge (test)	LM-O AR65.5	26
Object Pose Estimation	TUD-L BOP (test)	mAR64.2	23
Multi-view 6D pose estimation	T-LESS BOP (test)	AR68.2	12
Multi-view 6D pose estimation	YCB-V BOP (test)	AR69.7	12
6D Localization	T-LESS BOP (test)	AR59.2	9
6D Localization	YCB-V BOP (test)	AR62.6	9
6D Detection of Unseen Objects	T-LESS	AP0.62	6
6D Detection of Unseen Objects	TUD-L	AP0.841	6
6D Detection of Unseen Objects	YCB-V	AP80.8	6

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord