ReBaR: Reference-Based Reasoning for Robust Pose Estimation from Monocular Images
About
R}easoning for Robust Human Pose and Shape Estimation), designed to estimate human body shape and pose from single-view images. ReBaR effectively addresses the challenges of occlusions and depth ambiguity by learning reference features for part regression reasoning. Our approach starts by extracting features from both body and part regions using an attention-guided mechanism. Subsequently, these features are used to encode additional part-body dependencies for individual part regression, with part features serving as queries and the body feature as a reference. This reference-based reasoning allows our network to infer the spatial relationships of occluded parts with the body, utilizing visible parts and body reference information. ReBaR outperforms contemporary methods on three benchmark datasets and still maintains competitive advantages among recent new approaches. Demonstrating significant improvement in handling depth ambiguity and occlusion. These results strongly support the effectiveness of our reference-based framework for estimating human body shape and pose from single-view images.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Human Mesh Recovery | 3DPW (test) | MPJPE65.4 | 341 | |
| 3D Human Pose and Shape Estimation | AGORA (test) | NMJE (All)79.9 | 41 | |
| 3D Human Pose and Mesh Recovery | 3DPW | PA-MPJPE45.3 | 40 | |
| 3D Human Pose Estimation | 3DPW OCC (test) | PA-MPJPE48.3 | 31 | |
| 3D Human Pose Estimation | 3DOH (test) | PA-MPJPE55.5 | 20 |