Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images

About

In this paper, we present a generalizable model-free 6-DoF object pose estimator called Gen6D. Existing generalizable pose estimators either need high-quality object models or require additional depth maps or object masks in test time, which significantly limits their application scope. In contrast, our pose estimator only requires some posed images of the unseen object and is able to accurately predict the poses of the object in arbitrary environments. Gen6D consists of an object detector, a viewpoint selector and a pose refiner, all of which do not require the 3D object model and can generalize to unseen objects. Experiments show that Gen6D achieves state-of-the-art results on two model-free datasets: the MOPED dataset and a new GenMOP dataset collected by us. In addition, on the LINEMOD dataset, Gen6D achieves competitive results compared with instance-specific pose estimators. Project page: https://liuyuan-pal.github.io/Gen6D/.

Yuan Liu, Yilin Wen, Sida Peng, Cheng Lin, Xiaoxiao Long, Taku Komura, Wenping Wang• 2022

Related benchmarks

Task	Dataset	Result
6D Object Pose Estimation	LineMOD	Cam Accuracy60.7	60
6D Pose Estimation	LineMOD	ADD-0.1D (ape)77	9
Relative Rotation Estimation	OnePose++ POPE's Sampling (test)	Median Error17.78	8
Rotation Estimation	LINEMOD novel objects (test)	Acc @ 15° (benchvise)88.9	6
Rotation Estimation	LineMOD	Estimation Time (s)0.092	6
Rotation Estimation	LineMOD	Peak Memory (MB)705	5
Relative Rotation Estimation	LINEMOD POPE's Sampling (test)	Median Error44.86	4
Relative Rotation Estimation	YCB-Video POPE's Sampling (test)	Median Error (Deg)54.48	4
3D Reconstruction	FewSOL	Chamfer Distance (Pen)0.0046	3
6D Pose Estimation	MOPED	Chamfer Distance (Cheezit)8.70e+3	3

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord