Neural Body Fitting: Unifying Deep Learning and Model-Based Human Pose and Shape Estimation
About
Direct prediction of 3D body pose and shape remains a challenge even for highly parameterized deep learning models. Mapping from the 2D image space to the prediction space is difficult: perspective ambiguities make the loss function noisy and training data is scarce. In this paper, we propose a novel approach (Neural Body Fitting (NBF)). It integrates a statistical body model within a CNN, leveraging reliable bottom-up semantic body part segmentation and robust top-down body model constraints. NBF is fully differentiable and can be trained using 2D and 3D annotations. In detailed experiments, we analyze how the components of our model affect performance, especially the use of part segmentations as an explicit intermediate representation, and present a robust, efficiently trainable framework for 3D human pose estimation from 2D images with competitive results on standard benchmarks. Code will be made available at http://github.com/mohomran/neural_body_fitting
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Human Pose Estimation | Human3.6M (test) | MPJPE (Average)59.9 | 547 | |
| 3D Human Pose Estimation | 3DPW (test) | PA-MPJPE70.2 | 505 | |
| 3D Human Pose Estimation | Human3.6M (Protocol 2) | Average MPJPE59.9 | 315 | |
| 3D Human Pose Estimation | Human3.6M | -- | 160 | |
| 3D Human Mesh Recovery | Human3.6M (test) | -- | 120 | |
| 3D Human Pose and Shape Estimation | Human3.6M (test) | PA-MPJPE59.9 | 119 | |
| 3D Human Pose and Shape Estimation | Human3.6M | PA-MPJPE59.9 | 36 | |
| 3D Human Mesh Recovery | Human3.6M (Protocol 2) | Reconstruction Error59.9 | 29 | |
| 3D Human Pose Estimation | HumanEva I | Mean Error64 | 10 | |
| 3D Human Pose and Shape Estimation | 3DPW 2018 (test) | PA-MPJPE90.7 | 10 |