Body Meshes as Points
About
We consider the challenging multi-person 3D body mesh estimation task in this work. Existing methods are mostly two-stage based--one stage for person localization and the other stage for individual body mesh estimation, leading to redundant pipelines with high computation cost and degraded performance for complex scenes (e.g., occluded person instances). In this work, we present a single-stage model, Body Meshes as Points (BMP), to simplify the pipeline and lift both efficiency and performance. In particular, BMP adopts a new method that represents multiple person instances as points in the spatial-depth space where each point is associated with one body mesh. Hinging on such representations, BMP can directly predict body meshes for multiple persons in a single stage by concurrently localizing person instance points and estimating the corresponding body meshes. To better reason about depth ordering of all the persons within the same scene, BMP designs a simple yet effective inter-instance ordinal depth loss to obtain depth-coherent body mesh estimation. BMP also introduces a novel keypoint-aware augmentation to enhance model robustness to occluded person instances. Comprehensive experiments on benchmarks Panoptic, MuPoTS-3D and 3DPW clearly demonstrate the state-of-the-art efficiency of BMP for multi-person body mesh estimation, together with outstanding accuracy. Code can be found at: https://github.com/jfzhang95/BMP.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Human Pose Estimation | 3DPW (test) | PA-MPJPE63.8 | 505 | |
| 3D Human Mesh Recovery | 3DPW (test) | PA-MPJPE63.8 | 264 | |
| 3D Human Pose and Shape Estimation | 3DPW | PA-MPJPE63.8 | 74 | |
| 3D Body Mesh Recovery | Human3.6M | PA-MPJPE51.3 | 46 | |
| Multi-person 3D Pose Estimation | MuPoTS-3D (test) | 3DPCK73.83 | 41 | |
| 3D Human Pose and Mesh Recovery | Human3.6M | PA-MPJPE51.3 | 40 | |
| 3D Pose Estimation | MuPoTS-3D Matched (test) | Total Average Score75.3 | 23 | |
| 3D Human Pose Estimation | CMU Panoptic | Mean MPJPE135.4 | 18 | |
| 3D Human Pose Estimation | CMU Panoptic 18 | Haggling MPJPE120.4 | 14 | |
| 3D Human Pose Estimation | MuPoTS-3D all groundtruths (test) | Avg Score73.8 | 9 |