Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction
About
We present dynamic neural radiance fields for modeling the appearance and dynamics of a human face. Digitally modeling and reconstructing a talking human is a key building-block for a variety of applications. Especially, for telepresence applications in AR or VR, a faithful reproduction of the appearance including novel viewpoints or head-poses is required. In contrast to state-of-the-art approaches that model the geometry and material properties explicitly, or are purely image-based, we introduce an implicit representation of the head based on scene representation networks. To handle the dynamics of the face, we combine our scene representation network with a low-dimensional morphable model which provides explicit control over pose and expressions. We use volumetric rendering to generate images from this hybrid representation and demonstrate that such a dynamic neural scene representation can be learned from monocular input data only, without the need of a specialized capture setup. In our experiments, we show that this learned volumetric representation allows for photo-realistic image generation that surpasses the quality of state-of-the-art video-based reenactment methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Head Avatar Rendering | Authors' Monocular RGB Video Dataset Subject 2 (test) | LPIPS0.188 | 6 | |
| Head Avatar Rendering | NerFACE Subject 4 (test) | LPIPS0.093 | 6 | |
| Head Avatar Rendering | Authors' Monocular RGB Video Dataset Subject 1 (test) | LPIPS0.182 | 6 | |
| Head Avatar Rendering | Authors' Monocular RGB Video Dataset Subject 0 (test) | LPIPS0.205 | 6 | |
| 3D-aware face reconstruction | NerFACE (test) | PSNR24.092 | 6 | |
| Head Avatar Rendering | Authors' Monocular RGB Video Dataset Subject 3 (test) | LPIPS0.229 | 6 | |
| Head Avatar Reconstruction | 19 Video Sequences NHA, IMAvatar, and NeRFace | L2 Distance0.0018 | 4 | |
| Portrait Reanimation | Captured Video Sequence Subject 4 (test) | PSNR28.47 | 4 | |
| Portrait Reanimation | Captured Video Sequence Subject 2 (test) | PSNR24.57 | 4 | |
| Portrait Reanimation | Captured Video Sequence Subject 3 (test) | PSNR28 | 4 |