Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking

About

Monocular 3D human-pose estimation from static images is a challenging problem, due to the curse of dimensionality and the ill-posed nature of lifting 2D-to-3D. In this paper, we propose a Deep Conditional Variational Autoencoder based model that synthesizes diverse anatomically plausible 3D-pose samples conditioned on the estimated 2D-pose. We show that CVAE-based 3D-pose sample set is consistent with the 2D-pose and helps tackling the inherent ambiguity in 2D-to-3D lifting. We propose two strategies for obtaining the final 3D pose- (a) depth-ordering/ordinal relations to score and weight-average the candidate 3D-poses, referred to as OrdinalScore, and (b) with supervision from an Oracle. We report close to state of-the-art results on two benchmark datasets using OrdinalScore, and state-of-the-art results using the Oracle. We also show that our pipeline yields competitive results without paired image-to-3D annotations. The training and evaluation code is available at https://github.com/ssfootball04/generative_pose.

Saurabh Sharma, Pavan Teja Varigonda, Prashast Bindal, Abhishek Sharma, Arjun Jain• 2019

Related benchmarks

TaskDatasetResultRank
3D Human Pose EstimationHuman3.6M (test)
MPJPE (Average)32.7
547
3D Human Pose EstimationHuman3.6M (Protocol #1)
MPJPE (Avg.)46.8
440
3D Human Pose EstimationHuman3.6M (Protocol 2)
Average MPJPE32.7
315
3D Human Pose EstimationHuman3.6M Protocol 1 (test)
Dir. Error (Protocol 1)48.6
183
3D Human Pose EstimationHuman3.6M
MPJPE58
160
3D Human Pose EstimationHuman3.6M S9 and S11 (test)--
72
3D Pose EstimationHuman3.6M--
66
3D Human Pose EstimationHuman3.6M v1 (test)
Avg Performance53.4
58
3D Human Pose EstimationHuman3.6M Standard Protocol
MPJPE46.8
19
3D Human Pose EstimationHumanEva-I (Walking)
S1 Error38.5
18
Showing 10 of 17 rows

Other info

Code

Follow for update