Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation

About

3D human shape and pose estimation is the essential task for human motion analysis, which is widely used in many 3D applications. However, existing methods cannot simultaneously capture the relations at multiple levels, including spatial-temporal level and human joint level. Therefore they fail to make accurate predictions in some hard scenarios when there is cluttered background, occlusion, or extreme pose. To this end, we propose Multi-level Attention Encoder-Decoder Network (MAED), including a Spatial-Temporal Encoder (STE) and a Kinematic Topology Decoder (KTD) to model multi-level attentions in a unified framework. STE consists of a series of cascaded blocks based on Multi-Head Self-Attention, and each block uses two parallel branches to learn spatial and temporal attention respectively. Meanwhile, KTD aims at modeling the joint level attention. It regards pose estimation as a top-down hierarchical process similar to SMPL kinematic tree. With the training set of 3DPW, MAED outperforms previous state-of-the-art methods by 6.2, 7.2, and 2.4 mm of PA-MPJPE on the three widely used benchmarks 3DPW, MPI-INF-3DHP, and Human3.6M respectively. Our code is available at https://github.com/ziniuwan/maed.

Ziniu Wan, Zhengjia Li, Maoqing Tian, Jianbo Liu, Shuai Yi, Hongsheng Li• 2021

Related benchmarks

TaskDatasetResultRank
3D Human Pose Estimation3DPW (test)
PA-MPJPE45.7
505
3D Human Mesh Recovery3DPW (test)
PA-MPJPE45.7
264
3D Human Pose EstimationHuman3.6M
MPJPE56.4
160
3D Human Pose and Shape Estimation3DPW (test)
MPJPE-PA45.7
158
Human Mesh Recovery3DPW
PA-MPJPE45.7
123
3D Human Mesh RecoveryHuman3.6M (test)
PA-MPJPE38.7
120
3D Human Pose and Shape EstimationHuman3.6M (test)
PA-MPJPE38.7
119
3D Human Pose and Shape Estimation3DPW
PA-MPJPE45.7
74
3D Human Mesh Recovery3DPW
PA-MPJPE45.7
72
Human Mesh ReconstructionHuman3.6M
PA-MPJPE38.7
50
Showing 10 of 20 rows

Other info

Code

Follow for update