Photorealistic Monocular 3D Reconstruction of Humans Wearing Clothing
About
We present PHORHUM, a novel, end-to-end trainable, deep neural network methodology for photorealistic 3D human reconstruction given just a monocular RGB image. Our pixel-aligned method estimates detailed 3D geometry and, for the first time, the unshaded surface color together with the scene illumination. Observing that 3D supervision alone is not sufficient for high fidelity color reconstruction, we introduce patch-based rendering losses that enable reliable color reconstruction on visible parts of the human, and detailed and plausible color estimation for the non-visible parts. Moreover, our method specifically addresses methodological and practical limitations of prior work in terms of representing geometry, albedo, and illumination effects, in an end-to-end model where factors can be effectively disentangled. In extensive experiments, we demonstrate the versatility and robustness of our approach. Our state-of-the-art results validate the method qualitatively and for different metrics, for both geometric and color reconstruction.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Human Reconstruction (Normals Front) | Monocular 3D Human Reconstruction (test) | Angular Error18.41 | 15 | |
| 3D human reconstruction | Monocular 3D Human Reconstruction (test) | Ch. Distance1.1 | 15 | |
| 3D Human Reconstruction (Normals Back) | Monocular 3D Human Reconstruction (test) | Angular Error22.82 | 15 | |
| Monocular 3D human reconstruction | RenderPeople | Chamfer Distance2.92 | 13 | |
| 3D Human Reconstruction (Shaded Front) | Monocular 3D Human Reconstruction (test) | SSIM0.86 | 9 | |
| 3D Human Reconstruction (Albedo Back) | Monocular 3D Human Reconstruction (test) | SSIM0.76 | 6 | |
| 3D Human Reconstruction (Albedo Front) | Monocular 3D Human Reconstruction (test) | SSIM0.85 | 6 | |
| 3D human reconstruction | User Study RGB Textures (30 users ranking) | Front Texture Score3.45 | 4 |