Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-HMR: Multi-Person Whole-Body Human Mesh Recovery in a Single Shot

About

We present Multi-HMR, a strong sigle-shot model for multi-person 3D human mesh recovery from a single RGB image. Predictions encompass the whole body, i.e., including hands and facial expressions, using the SMPL-X parametric model and 3D location in the camera coordinate system. Our model detects people by predicting coarse 2D heatmaps of person locations, using features produced by a standard Vision Transformer (ViT) backbone. It then predicts their whole-body pose, shape and 3D location using a new cross-attention module called the Human Prediction Head (HPH), with one query attending to the entire set of features for each detected person. As direct prediction of fine-grained hands and facial poses in a single shot, i.e., without relying on explicit crops around body parts, is hard to learn from existing data, we introduce CUFFS, the Close-Up Frames of Full-Body Subjects dataset, containing humans close to the camera with diverse hand poses. We show that incorporating it into the training data further enhances predictions, particularly for hands. Multi-HMR also optionally accounts for camera intrinsics, if available, by encoding camera ray directions for each image token. This simple design achieves strong performance on whole-body and body-only benchmarks simultaneously: a ViT-S backbone on $448{\times}448$ images already yields a fast and competitive model, while larger models and higher resolutions obtain state-of-the-art results.

Fabien Baradel, Matthieu Armando, Salma Galaaoui, Romain Br\'egier, Philippe Weinzaepfel, Gr\'egory Rogez, Thomas Lucas• 2024

Related benchmarks

TaskDatasetResultRank
3D Human Pose Estimation3DPW (test)
PA-MPJPE41.7
505
3D Human Pose Estimation3DPW
PA-MPJPE45.9
119
3D Human Mesh Recovery3DPW
PA-MPJPE41.7
72
Human Mesh Reconstruction3DPW 14 joints (test)
PA-MPJPE45.9
26
3D Whole-body Human Pose and Shape EstimationAGORA SMPL-X (test)
NMVE (All)102
22
3D Human Pose Estimation3DPW 14 joints (test)
MPJPE61.4
21
Human Mesh ReconstructionEMDB 24 joints (test)
PA-MPJPE50.1
21
3D Human Mesh EstimationMuPoTS
3DPCK (All)85
14
Human Mesh RecoverySelf-contact dataset (test)
PA-V2V57.46
12
Whole-body Mesh RecoveryEHF
PVEJ (All)42
11
Showing 10 of 30 rows

Other info

Code

Follow for update