Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image

About

We propose a unified formulation for the problem of 3D human pose estimation from a single raw RGB image that reasons jointly about 2D joint estimation and 3D pose reconstruction to improve both tasks. We take an integrated approach that fuses probabilistic knowledge of 3D human pose with a multi-stage CNN architecture and uses the knowledge of plausible 3D landmark locations to refine the search for better 2D locations. The entire process is trained end-to-end, is extremely efficient and obtains state- of-the-art results on Human3.6M outperforming previous approaches both on 2D and 3D errors.

Denis Tome, Chris Russell, Lourdes Agapito• 2017

Related benchmarks

TaskDatasetResultRank
3D Human Pose EstimationHuman3.6M (test)
MPJPE (Average)88.39
547
3D Human Pose EstimationHuman3.6M (Protocol #1)
MPJPE (Avg.)70.7
440
3D Human Pose EstimationHuman3.6M (Protocol 2)
Average MPJPE70.7
315
3D Human Pose EstimationHuman3.6M (subjects 9 and 11)
Average Error88.4
180
3D Human Pose EstimationHuman3.6M S9 and S11 (test)
Dir. Error65
72
3D Human Pose EstimationHuman3.6M Protocol-III (test)
Discussion Error (MPJPE)73.5
20
3D Human Pose EstimationH36M (all subjects)
MPJPE88.4
13
3D Human Pose EstimationH3.6M (val)--
8
3D Human Pose ReconstructionPoseKernel 1.0 (minor participants)
Head Error57.49
5
3D Human Pose ReconstructionPoseKernel 1.0 (adult participants)
Head Error34.77
5
Showing 10 of 13 rows

Other info

Follow for update