Compositional Human Pose Regression
About
Regression based methods are not performing as well as detection based methods for human pose estimation. A central problem is that the structural information in the pose is not well exploited in the previous regression methods. In this work, we propose a structure-aware regression approach. It adopts a reparameterized pose representation using bones instead of joints. It exploits the joint connection structure to define a compositional loss function that encodes the long range interactions in the pose. It is simple, effective, and general for both 2D and 3D pose estimation in a unified setting. Comprehensive evaluation validates the effectiveness of our approach. It significantly advances the state-of-the-art on Human3.6M and is competitive with state-of-the-art results on MPII.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Human Pose Estimation | Human3.6M (test) | MPJPE (Average)30.6 | 547 | |
| 3D Human Pose Estimation | Human3.6M (Protocol #1) | MPJPE (Avg.)48.3 | 440 | |
| 3D Human Pose Estimation | Human3.6M (Protocol 2) | Average MPJPE48.3 | 315 | |
| Human Pose Estimation | MPII (test) | Shoulder PCK94.3 | 314 | |
| 3D Human Pose Estimation | Human3.6M Protocol 1 (test) | Dir. Error (Protocol 1)42.1 | 183 | |
| 3D Human Pose Estimation | Human3.6M (subjects 9 and 11) | Average Error59.1 | 180 | |
| 3D Human Pose Estimation | Human3.6M | -- | 160 | |
| 3D Human Pose Estimation | Human3.6M Protocol #2 (test) | Average Error48.3 | 140 | |
| 3D Human Pose Estimation | Human3.6M (S9, S11) | Average Error (MPJPE Avg)59.1 | 94 | |
| 3D Human Pose Estimation | Human3.6M S9 and S11 (test) | Dir. Error39.5 | 72 |