NOPE: Novel Object Pose Estimation from a Single Image

About

The practicality of 3D object pose estimation remains limited for many applications due to the need for prior knowledge of a 3D model and a training period for new objects. To address this limitation, we propose an approach that takes a single image of a new object as input and predicts the relative pose of this object in new images without prior knowledge of the object's 3D model and without requiring training time for new objects and categories. We achieve this by training a model to directly predict discriminative embeddings for viewpoints surrounding the object. This prediction is done using a simple U-Net architecture with attention and conditioned on the desired pose, which yields extremely fast inference. We compare our approach to state-of-the-art methods and show it outperforms them both in terms of accuracy and robustness. Our source code is publicly available at https://github.com/nv-nguyen/nope

Van Nguyen Nguyen, Thibault Groueix, Yinlin Hu, Mathieu Salzmann, Vincent Lepetit• 2023

Related benchmarks

Task	Dataset	Result
6D Object Pose Estimation	LineMOD	Cam Accuracy2.5	60
Object Pose Estimation	LineMod (test)	ADD (0.1d) Error16.3	22
Object Pose Estimation	T-LESS (seen (obj. 1-18) and novel (obj. 19-30))	VSD Recall (Seen)49.3	11
6D Pose Estimation	occluded YCB-Video (test)	ADD-S86	8
Object Pose Estimation	ShapeNet Unseen Categories (test)	Acc@3087.1	7
6D Object Pose Estimation	General Inference Efficiency Benchmark (test)	Inference Time (s)20.15	6

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord