Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Graph Stacked Hourglass Networks for 3D Human Pose Estimation

About

In this paper, we propose a novel graph convolutional network architecture, Graph Stacked Hourglass Networks, for 2D-to-3D human pose estimation tasks. The proposed architecture consists of repeated encoder-decoder, in which graph-structured features are processed across three different scales of human skeletal representations. This multi-scale architecture enables the model to learn both local and global feature representations, which are critical for 3D human pose estimation. We also introduce a multi-level feature learning approach using different-depth intermediate features and show the performance improvements that result from exploiting multi-scale, multi-level feature representations. Extensive experiments are conducted to validate our approach, and the results show that our model outperforms the state-of-the-art.

Tianhan Xu, Wataru Takano• 2021

Related benchmarks

TaskDatasetResultRank
3D Human Pose EstimationMPI-INF-3DHP (test)
PCK80.1
584
3D Human Pose EstimationHuman3.6M (test)
MPJPE (Average)35.8
547
3D Human Pose EstimationHuman3.6M (Protocol #1)
MPJPE (Avg.)35.8
440
3D Human Pose EstimationHuman3.6M
MPJPE51.9
184
3D Human Pose EstimationHuman3.6M Protocol 1 (test)
Dir. Error (Protocol 1)35.8
183
3D Human Pose EstimationHuman3.6M (S9, S11)
Average Error (MPJPE Avg)51.9
94
3D Human Pose EstimationHuman3.6M S9 and S11 (test)
Dir. Error35.8
72
3D Pose EstimationHuman3.6M--
66
3D Human Pose EstimationHuman3.6M v1 (test)
Avg Performance35.8
58
3D Human Pose EstimationHuman3.6M GT 2D pose sequences (test)
MPJPE (Dire.)35.8
29
Showing 10 of 11 rows

Other info

Follow for update