Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion

About

A key technical challenge in performing 6D object pose estimation from RGB-D image is to fully leverage the two complementary data sources. Prior works either extract information from the RGB image and depth separately or use costly post-processing steps, limiting their performances in highly cluttered scenes and real-time applications. In this work, we present DenseFusion, a generic framework for estimating 6D pose of a set of known objects from RGB-D images. DenseFusion is a heterogeneous architecture that processes the two data sources individually and uses a novel dense fusion network to extract pixel-wise dense feature embedding, from which the pose is estimated. Furthermore, we integrate an end-to-end iterative pose refinement procedure that further improves the pose estimation while achieving near real-time inference. Our experiments show that our method outperforms state-of-the-art approaches in two datasets, YCB-Video and LineMOD. We also deploy our proposed method to a real robot to grasp and manipulate objects based on the estimated pose.

Chen Wang, Danfei Xu, Yuke Zhu, Roberto Mart\'in-Mart\'in, Cewu Lu, Li Fei-Fei, Silvio Savarese• 2019

Related benchmarks

TaskDatasetResultRank
6D Pose EstimationYCB-Video
AUC (ADD-S)0.964
151
6DoF Pose EstimationDTTD-Mobile
ADD-S AUC96.32
115
6DoF Pose EstimationYCB-Video (test)
2D Error < 2cm Rate100
72
6D Object Pose EstimationLineMOD--
60
6D Pose EstimationLineMod (test)
Ape92.3
29
6D Object Pose EstimationBOP challenge (test)
LM-O AR73
26
Object Pose EstimationLineMod (test)--
22
6D Object Pose EstimationT-LESS BOP challenge protocol PrimeSense (test)
VSD10
20
Object Pose EstimationLineMod (test)
APE92.3
18
6D Pose EstimationLineMOD
ADD (S)86.2
16
Showing 10 of 16 rows

Other info

Code

Follow for update