Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HDNet: Human Depth Estimation for Multi-Person Camera-Space Localization

About

Current works on multi-person 3D pose estimation mainly focus on the estimation of the 3D joint locations relative to the root joint and ignore the absolute locations of each pose. In this paper, we propose the Human Depth Estimation Network (HDNet), an end-to-end framework for absolute root joint localization in the camera coordinate space. Our HDNet first estimates the 2D human pose with heatmaps of the joints. These estimated heatmaps serve as attention masks for pooling features from image regions corresponding to the target person. A skeleton-based Graph Neural Network (GNN) is utilized to propagate features among joints. We formulate the target depth regression as a bin index estimation problem, which can be transformed with a soft-argmax operation from the classification output of our HDNet. We evaluate our HDNet on the root joint localization and root-relative 3D pose estimation tasks with two benchmark datasets, i.e., Human3.6M and MuPoTS-3D. The experimental results show that we outperform the previous state-of-the-art consistently under multiple evaluation metrics. Our source code is available at: https://github.com/jiahaoLjh/HumanDepth.

Jiahao Lin, Gim Hee Lee• 2020

Related benchmarks

TaskDatasetResultRank
Multi-person 3D Pose EstimationMuPoTS-3D (test)
3DPCK83.7
41
3D Multi-person Pose EstimationMuPoTS-3D Matched people
PCKrel83.7
22
3D Multi-person Pose EstimationMuPoTS-3D
3D PCK Score83.7
21
3D Human Pose EstimationCMU Panoptic (test)--
15
Root-relative 3D pose estimationMuPoTS-3D (test)
S1 Score94.4
7
3D Human Pose EstimationMuPoTS synthOCC (Matched people)
PCK Relative72.3
6
Root joint localizationHuman3.6M (test)
MRPE77.6
5
Root joint localizationMuPoTS-3D (test)
AP @ 25mm39.4
2
Showing 8 of 8 rows

Other info

Code

Follow for update