Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DPOD: 6D Pose Object Detector and Refiner

About

In this paper we present a novel deep learning method for 3D object detection and 6D pose estimation from RGB images. Our method, named DPOD (Dense Pose Object Detector), estimates dense multi-class 2D-3D correspondence maps between an input image and available 3D models. Given the correspondences, a 6DoF pose is computed via PnP and RANSAC. An additional RGB pose refinement of the initial pose estimates is performed using a custom deep learning-based refinement scheme. Our results and comparison to a vast number of related works demonstrate that a large number of correspondences is beneficial for obtaining high-quality 6D poses both before and after refinement. Unlike other methods that mainly use real data for training and do not train on synthetic renderings, we perform evaluation on both synthetic and real training data demonstrating superior results before and after refinement when compared to all recent detectors. While being precise, the presented approach is still real-time capable.

Sergey Zakharov, Ivan Shugurov, Slobodan Ilic• 2019

Related benchmarks

TaskDatasetResultRank
6D Pose EstimationYCB-Video--
148
6D Object Pose EstimationLineMOD
Average Accuracy95.15
50
6D Object Pose EstimationOccludedLINEMOD (test)
ADD(S)47.3
45
6-DoF Pose EstimationYCB-V BOP challenge 2020
AR22.2
37
6D Pose EstimationLineMod (test)
Ape87.73
29
6D Object Pose EstimationOcclusion LINEMOD
Average Error47.3
27
Object Pose EstimationLineMod (test)
Average Accuracy95.2
21
6D Pose EstimationHomebrewed BOP challenge (test)
Avg Recall28.6
20
6D Pose EstimationOcclusion dataset BOP challenge (test)
AR16.9
19
Object Pose EstimationLineMod (test)
APE87.7
18
Showing 10 of 14 rows

Other info

Follow for update