Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

RGB-based Category-level Object Pose Estimation via Decoupled Metric Scale Recovery

About

While showing promising results, recent RGB-D camera-based category-level object pose estimation methods have restricted applications due to the heavy reliance on depth sensors. RGB-only methods provide an alternative to this problem yet suffer from inherent scale ambiguity stemming from monocular observations. In this paper, we propose a novel pipeline that decouples the 6D pose and size estimation to mitigate the influence of imperfect scales on rigid transformations. Specifically, we leverage a pre-trained monocular estimator to extract local geometric information, mainly facilitating the search for inlier 2D-3D correspondence. Meanwhile, a separate branch is designed to directly recover the metric scale of the object based on category-level statistics. Finally, we advocate using the RANSAC-P$n$P algorithm to robustly solve for 6D object pose. Extensive experiments have been conducted on both synthetic and real datasets, demonstrating the superior performance of our method over previous state-of-the-art RGB-based approaches, especially in terms of rotation accuracy. Code: https://github.com/goldoak/DMSR.

Jiaxin Wei, Xibin Song, Weizhe Liu, Laurent Kneip, Hongdong Li, Pan Ji• 2023

Related benchmarks

TaskDatasetResultRank
Category-level 6D Pose EstimationREAL275 (test)
Pose Acc (5°/5cm)67.2
53
Category-level Object Pose EstimationCAMERA25 67 (test)
NIOU@2574.4
5
Showing 2 of 2 rows

Other info

Follow for update