Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CameraHMR: Aligning People with Perspective

About

We address the challenge of accurate 3D human pose and shape estimation from monocular images. The key to accuracy and robustness lies in high-quality training data. Existing training datasets containing real images with pseudo ground truth (pGT) use SMPLify to fit SMPL to sparse 2D joint locations, assuming a simplified camera with default intrinsics. We make two contributions that improve pGT accuracy. First, to estimate camera intrinsics, we develop a field-of-view prediction model (HumanFoV) trained on a dataset of images containing people. We use the estimated intrinsics to enhance the 4D-Humans dataset by incorporating a full perspective camera model during SMPLify fitting. Second, 2D joints provide limited constraints on 3D body shape, resulting in average-looking bodies. To address this, we use the BEDLAM dataset to train a dense surface keypoint detector. We apply this detector to the 4D-Humans dataset and modify SMPLify to fit the detected keypoints, resulting in significantly more realistic body shapes. Finally, we upgrade the HMR2.0 architecture to include the estimated camera parameters. We iterate model training and SMPLify fitting initialized with the previously trained model. This leads to more accurate pGT and a new model, CameraHMR, with state-of-the-art accuracy. Code and pGT are available for research purposes.

Priyanka Patel, Michael J. Black• 2024

Related benchmarks

TaskDatasetResultRank
3D Human Mesh Recovery3DPW (test)
PA-MPJPE35.1
264
Human Mesh Reconstruction3DPW 14 joints (test)
PA-MPJPE35.1
26
Human Mesh ReconstructionEMDB 24 joints (test)
PA-MPJPE43.3
21
Human Mesh Recovery3DPW 14 (test)
PA-MPJPE35.1
10
Human Mesh ReconstructionRICH 24 joints (test)
PA-MPJPE34
8
Human Mesh RecoveryRICH 24 (test)
PA-MPJPE34
8
Human Mesh RecoveryCOCO
Success Rate @ Error 0.0580.5
8
3D Human Motion GenerationCoMoVi Dataset (test)
FID0.815
7
3D Human Motion GenerationMotion-X++ (test)
FID21.538
7
Human Mesh RecoveryLSPET
PCK@0.0549.1
6
Showing 10 of 16 rows

Other info

Follow for update