Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

6-DoF Object Pose from Semantic Keypoints

About

This paper presents a novel approach to estimating the continuous six degree of freedom (6-DoF) pose (3D translation and rotation) of an object from a single RGB image. The approach combines semantic keypoints predicted by a convolutional network (convnet) with a deformable shape model. Unlike prior work, we are agnostic to whether the object is textured or textureless, as the convnet learns the optimal representation from the available training image data. Furthermore, the approach can be applied to instance- and class-based pose recovery. Empirically, we show that the proposed approach can accurately recover the 6-DoF object pose for both instance- and class-based scenarios with a cluttered background. For class-based object pose estimation, state-of-the-art accuracy is shown on the large-scale PASCAL3D+ dataset.

Georgios Pavlakos, Xiaowei Zhou, Aaron Chan, Konstantinos G. Derpanis, Kostas Daniilidis• 2017

Related benchmarks

TaskDatasetResultRank
Viewpoint EstimationPASCAL3D+
Aero Error Rate8
20
2D Keypoint LocalizationPASCAL3D+ (test)
Aero Acc92.3
6
Viewpoint EstimationPascal3D+ v1.0 (test)
Aeroplane Error8
5
6-DoF Pose EstimationGas canister dataset (test)
Mean Rotation Error (deg)3.57
3
Showing 4 of 4 rows

Other info

Code

Follow for update