Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning 3D Human Pose from Structure and Motion

About

3D human pose estimation from a single image is a challenging problem, especially for in-the-wild settings due to the lack of 3D annotated data. We propose two anatomically inspired loss functions and use them with a weakly-supervised learning framework to jointly learn from large-scale in-the-wild 2D and indoor/synthetic 3D data. We also present a simple temporal network that exploits temporal and structural cues present in predicted pose sequences to temporally harmonize the pose estimations. We carefully analyze the proposed contributions through loss surface visualizations and sensitivity analysis to facilitate deeper understanding of their working mechanism. Our complete pipeline improves the state-of-the-art by 11.8% and 12% on Human3.6M and MPI-INF-3DHP, respectively, and runs at 30 FPS on a commodity graphics card.

Rishabh Dabral, Anurag Mundhada, Uday Kusupati, Safeer Afaque, Abhishek Sharma, Arjun Jain• 2017

Related benchmarks

TaskDatasetResultRank
3D Human Pose EstimationMPI-INF-3DHP (test)
PCK76.7
584
3D Human Pose EstimationHuman3.6M (test)
MPJPE (Average)36.3
547
3D Human Pose Estimation3DPW (test)
PA-MPJPE92.3
514
3D Human Pose EstimationHuman3.6M (Protocol #1)
MPJPE (Avg.)52.1
440
3D Human Pose EstimationHuman3.6M (Protocol 2)
Average MPJPE36.3
315
3D Human Pose EstimationHuman3.6M--
184
3D Human Pose EstimationHuman3.6M Protocol 1 (test)
Dir. Error (Protocol 1)44.8
183
3D Human Pose EstimationHuman3.6M (subjects 9 and 11)
Average Error52.1
180
3D Human Pose EstimationHuman3.6M Protocol #2 (test)
Average Error36.3
140
3D Human Pose EstimationMPI-INF-3DHP
PCK76.7
114
Showing 10 of 15 rows

Other info

Follow for update