Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images
About
In this paper, we present a generalizable model-free 6-DoF object pose estimator called Gen6D. Existing generalizable pose estimators either need high-quality object models or require additional depth maps or object masks in test time, which significantly limits their application scope. In contrast, our pose estimator only requires some posed images of the unseen object and is able to accurately predict the poses of the object in arbitrary environments. Gen6D consists of an object detector, a viewpoint selector and a pose refiner, all of which do not require the 3D object model and can generalize to unseen objects. Experiments show that Gen6D achieves state-of-the-art results on two model-free datasets: the MOPED dataset and a new GenMOP dataset collected by us. In addition, on the LINEMOD dataset, Gen6D achieves competitive results compared with instance-specific pose estimators. Project page: https://liuyuan-pal.github.io/Gen6D/.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 6D Object Pose Estimation | LineMOD | -- | 50 | |
| Relative Rotation Estimation | OnePose++ POPE's Sampling (test) | Median Error17.78 | 8 | |
| Rotation Estimation | LINEMOD novel objects (test) | Acc @ 15° (benchvise)88.9 | 6 | |
| Rotation Estimation | LineMOD | Estimation Time (s)0.092 | 6 | |
| Rotation Estimation | LineMOD | Peak Memory (MB)705 | 5 | |
| Relative Rotation Estimation | LINEMOD POPE's Sampling (test) | Median Error44.86 | 4 | |
| Relative Rotation Estimation | YCB-Video POPE's Sampling (test) | Median Error (Deg)54.48 | 4 | |
| 3D Reconstruction | FewSOL | Chamfer Distance (Pen)0.0046 | 3 | |
| 6D Pose Estimation | MOPED | Chamfer Distance (Cheezit)8.70e+3 | 3 |