Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HeatFormer: A Neural Optimizer for Multiview Human Mesh Recovery

About

We introduce a novel method for human shape and pose recovery that can fully leverage multiple static views. We target fixed-multiview people monitoring, including elderly care and safety monitoring, in which calibrated cameras can be installed at the corners of a room or an open space but whose configuration may vary depending on the environment. Our key idea is to formulate it as neural optimization. We achieve this with HeatFormer, a neural optimizer that iteratively refines the SMPL parameters given multiview images, which is fundamentally agonistic to the configuration of views. HeatFormer realizes this SMPL parameter estimation as heat map generation and alignment with a novel transformer encoder and decoder. We demonstrate the effectiveness of HeatFormer including its accuracy, robustness to occlusion, and generalizability through an extensive set of experiments. We believe HeatFormer can serve a key role in passive human behavior modeling.

Yuto Matsubara, Ko Nishino• 2024

Related benchmarks

TaskDatasetResultRank
3D Human Pose EstimationMPI-INF-3DHP (test)
PCK99.5
559
3D Human Pose EstimationHuman3.6M
MPJPE28.6
160
Human Mesh RecoveryMPI-INF-3DHP
MPJPE59.8
35
Human Mesh RecoveryBEHAVE (Protocol 1)
MPJPE48.9
8
Human Mesh RecoveryBEHAVE (Protocol 2)
MPJPE32.6
8
Human Mesh RecoveryRICH
PA-MPJPE44.9
7
Human Mesh RecoveryBEHAVE
PA-MPJPE33.8
7
Human Mesh RecoveryMoYo
PA-MPJPE85.7
7
Human Mesh Recovery4D-DRESS
PA-MPJPE43.8
7
Human Mesh Recovery4D-DRESS partial
PA-MPJPE140.1
7
Showing 10 of 10 rows

Other info

Code

Follow for update