Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Pose Grammar to Encode Human Body Configuration for 3D Pose Estimation

About

In this paper, we propose a pose grammar to tackle the problem of 3D human pose estimation. Our model directly takes 2D pose as input and learns a generalized 2D-3D mapping function. The proposed model consists of a base network which efficiently captures pose-aligned features and a hierarchy of Bi-directional RNNs (BRNN) on the top to explicitly incorporate a set of knowledge regarding human body configuration (i.e., kinematics, symmetry, motor coordination). The proposed model thus enforces high-level constraints over human poses. In learning, we develop a pose sample simulator to augment training samples in virtual camera views, which further improves our model generalizability. We validate our method on public 3D human pose benchmarks and propose a new evaluation protocol working on cross-view setting to verify the generalization capability of different methods. We empirically observe that most state-of-the-art methods encounter difficulty under such setting while our method can well handle such challenges.

Haoshu Fang, Yuanlu Xu, Wenguan Wang, Xiaobai Liu, Song-Chun Zhu• 2017

Related benchmarks

TaskDatasetResultRank
3D Human Pose EstimationHuman3.6M (test)
MPJPE (Average)45.7
547
3D Human Pose EstimationHuman3.6M (Protocol #1)
MPJPE (Avg.)45.7
440
3D Human Pose EstimationHuman3.6M (Protocol 2)
Average MPJPE45.7
315
3D Human Pose EstimationHuman3.6M Protocol 1 (test)
Dir. Error (Protocol 1)38.2
183
3D Human Pose EstimationHuman3.6M (subjects 9 and 11)--
180
3D Human Pose EstimationHuman3.6M
MPJPE60.4
160
3D Human Pose EstimationHuman3.6M Protocol #2 (test)
Average Error45.7
140
3D Human Pose EstimationHumanEva-I (test)
Walking S1 Error (mm)19.4
85
3D Human Pose EstimationHuman3.6M S9 and S11 (test)
Dir. Error38.2
72
3D Pose EstimationHuman3.6M--
66
Showing 10 of 16 rows

Other info

Follow for update