OnePose: One-Shot Object Pose Estimation without CAD Models

About

We propose a new method named OnePose for object pose estimation. Unlike existing instance-level or category-level methods, OnePose does not rely on CAD models and can handle objects in arbitrary categories without instance- or category-specific network training. OnePose draws the idea from visual localization and only requires a simple RGB video scan of the object to build a sparse SfM model of the object. Then, this model is registered to new query images with a generic feature matching network. To mitigate the slow runtime of existing visual localization methods, we propose a new graph attention network that directly matches 2D interest points in the query image with the 3D points in the SfM model, resulting in efficient and robust pose estimation. Combined with a feature-based pose tracker, OnePose is able to stably detect and track 6D poses of everyday household objects in real-time. We also collected a large-scale dataset that consists of 450 sequences of 150 objects.

Jiaming Sun, Zihao Wang, Siyu Zhang, Xingyi He, Hongcheng Zhao, Guofeng Zhang, Xiaowei Zhou• 2022

Related benchmarks

Task	Dataset	Result
6D Object Pose Estimation	LineMOD	Cam Accuracy88.1	60
Object Pose Estimation	OnePose	Mean Pixel Error (2D)0.012	30
Object Pose Estimation	LineMod (test)	ADD (0.1d) Error63.6	22
6D Pose Estimation	LineMOD	ADD-0.1D (ape)11.8	9
Object Tracking	OnePose original (test)	Accuracy (1cm/1°)49.7	6
Object Tracking	OnePose Low Texture original (test)	Acc (1cm, 1°)12.4	6
Visual Localization	OnePose Large Objects (test)	Acc (1cm-1deg)47.1	4
Visual Localization	OnePose Medium Objects (test)	Acc (1cm-1deg)62.9	4
Visual Localization	OnePose Small Objects (test)	Accuracy (1cm-1deg)40.5	4

Showing 9 of 9 rows

Other info

Code

Follow for update

@wizwand_team Discord