Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-Person 3D Human Pose Estimation from Monocular Images

About

Multi-person 3D human pose estimation from a single image is a challenging problem, especially for in-the-wild settings due to the lack of 3D annotated data. We propose HG-RCNN, a Mask-RCNN based network that also leverages the benefits of the Hourglass architecture for multi-person 3D Human Pose Estimation. A two-staged approach is presented that first estimates the 2D keypoints in every Region of Interest (RoI) and then lifts the estimated keypoints to 3D. Finally, the estimated 3D poses are placed in camera-coordinates using weak-perspective projection assumption and joint optimization of focal length and root translations. The result is a simple and modular network for multi-person 3D human pose estimation that does not require any multi-person 3D pose dataset. Despite its simple formulation, HG-RCNN achieves the state-of-the-art results on MuPoTS-3D while also approximating the 3D pose in the camera-coordinate system.

Rishabh Dabral, Nitesh B Gundavarapu, Rahul Mitra, Abhishek Sharma, Ganesh Ramakrishnan, Arjun Jain• 2019

Related benchmarks

TaskDatasetResultRank
Object DetectionCOCO 2017 (val)
AP55.36
2454
3D Human Pose EstimationHuman3.6M (subjects 9 and 11)
Average Error65.2
180
3D Human Pose EstimationHuman3.6M--
160
Multi-person 3D Pose EstimationMuPoTS-3D (test)
3DPCK71.3
41
Keypoint DetectionMS-COCO 2017 (val)
AP63.48
40
3D Multi-person Pose EstimationMuPoTS-3D All people--
24
3D Pose EstimationMuPoTS-3D Matched (test)
Total Average Score74.2
23
3D Multi-person Pose EstimationMuPoTS-3D Matched people
PCKrel74.2
22
3D Multi-person Pose EstimationMuPoTS-3D--
21
3D Pose EstimationMuPoTS-3D 2018 (All annotated poses)
TS185.1
7
Showing 10 of 12 rows

Other info

Follow for update