Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PyMAF-X: Towards Well-aligned Full-body Model Regression from Monocular Images

About

We present PyMAF-X, a regression-based approach to recovering parametric full-body models from monocular images. This task is very challenging since minor parametric deviation may lead to noticeable misalignment between the estimated mesh and the input image. Moreover, when integrating part-specific estimations into the full-body model, existing solutions tend to either degrade the alignment or produce unnatural wrist poses. To address these issues, we propose a Pyramidal Mesh Alignment Feedback (PyMAF) loop in our regression network for well-aligned human mesh recovery and extend it as PyMAF-X for the recovery of expressive full-body models. The core idea of PyMAF is to leverage a feature pyramid and rectify the predicted parameters explicitly based on the mesh-image alignment status. Specifically, given the currently predicted parameters, mesh-aligned evidence will be extracted from finer-resolution features accordingly and fed back for parameter rectification. To enhance the alignment perception, an auxiliary dense supervision is employed to provide mesh-image correspondence guidance while spatial alignment attention is introduced to enable the awareness of the global contexts for our network. When extending PyMAF for full-body mesh recovery, an adaptive integration strategy is proposed in PyMAF-X to produce natural wrist poses while maintaining the well-aligned performance of the part-specific estimations. The efficacy of our approach is validated on several benchmark datasets for body, hand, face, and full-body mesh recovery, where PyMAF and PyMAF-X effectively improve the mesh-image alignment and achieve new state-of-the-art results. The project page with code and video results can be found at https://zhanghongwen.cn/pymaf-x.

Hongwen Zhang, Yating Tian, Yuxiang Zhang, Mengcheng Li, Liang An, Zhenan Sun, Yebin Liu• 2022

Related benchmarks

TaskDatasetResultRank
3D Human Pose EstimationHuman3.6M (test)--
547
3D Human Pose Estimation3DPW (test)
PA-MPJPE45.3
505
3D Human Mesh Recovery3DPW (test)
PA-MPJPE58.9
264
3D Human Mesh RecoveryHuman3.6M (test)
PA-MPJPE40.5
120
3D Human Mesh Recovery3DPW
PA-MPJPE47.1
72
3D Human Pose and Shape EstimationAGORA (test)
NMJE (All)140
41
3D Whole-body Human Pose and Shape EstimationAGORA SMPL-X (test)
NMVE (All)141.2
22
3D Whole-Body Mesh RecoveryAGORA (test)
NMVE141.2
20
Human Pose EstimationCOCO 2014 (val)
AP47.7
18
3D Face ReconstructionNoW
Median Error (mm)1.13
17
Showing 10 of 32 rows

Other info

Code

Follow for update