ARAH: Animatable Volume Rendering of Articulated Human SDFs
About
Combining human body models with differentiable rendering has recently enabled animatable avatars of clothed humans from sparse sets of multi-view RGB videos. While state-of-the-art approaches achieve realistic appearance with neural radiance fields (NeRF), the inferred geometry often lacks detail due to missing geometric constraints. Further, animating avatars in out-of-distribution poses is not yet possible because the mapping from observation space to canonical space does not generalize faithfully to unseen poses. In this work, we address these shortcomings and propose a model to create animatable clothed human avatars with detailed geometry that generalize well to out-of-distribution poses. To achieve detailed geometry, we combine an articulated implicit surface representation with volume rendering. For generalization, we propose a novel joint root-finding algorithm for simultaneous ray-surface intersection search and correspondence search. Our algorithm enables efficient point sampling and accurate point canonicalization while generalizing well to unseen poses. We demonstrate that our proposed pipeline can generate clothed avatars with high-quality pose-dependent geometry and appearance from a sparse set of multi-view RGB videos. Our method achieves state-of-the-art performance on geometry and appearance reconstruction while creating animatable avatars that generalize well to out-of-distribution poses beyond the small number of training poses.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | ZJU-MoCap | PSNR28.51 | 23 | |
| Novel View Synthesis | ZJU-MoCap novel view setting | PSNR28.4 | 14 | |
| Novel Pose Synthesis | ZJU-MoCap (Novel Pose) | PSNR24.5 | 10 | |
| Novel Pose Synthesis | ZJU 512x512 | PSNR24.63 | 9 | |
| Novel Pose Synthesis | Thuman4 (novel poses) | PSNR21.77 | 6 | |
| Novel View Synthesis | Thuman4 (train poses) | PSNR22.02 | 6 | |
| Novel Pose Synthesis | DeepCap and DynaCap | PSNR17.8 | 5 | |
| Novel View Synthesis | DeepCap and DynaCap | PSNR19.5 | 5 | |
| 3D Geometry Reconstruction | DynaCap (subject D2) | Chamfer Distance12.985 | 4 | |
| Novel View Synthesis | ZJU-MoCap S377 (train) | LPIPS9.6 | 2 |