Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction

About

We present dynamic neural radiance fields for modeling the appearance and dynamics of a human face. Digitally modeling and reconstructing a talking human is a key building-block for a variety of applications. Especially, for telepresence applications in AR or VR, a faithful reproduction of the appearance including novel viewpoints or head-poses is required. In contrast to state-of-the-art approaches that model the geometry and material properties explicitly, or are purely image-based, we introduce an implicit representation of the head based on scene representation networks. To handle the dynamics of the face, we combine our scene representation network with a low-dimensional morphable model which provides explicit control over pose and expressions. We use volumetric rendering to generate images from this hybrid representation and demonstrate that such a dynamic neural scene representation can be learned from monocular input data only, without the need of a specialized capture setup. In our experiments, we show that this learned volumetric representation allows for photo-realistic image generation that surpasses the quality of state-of-the-art video-based reenactment methods.

Guy Gafni, Justus Thies, Michael Zollh\"ofer, Matthias Nie{\ss}ner• 2020

Related benchmarks

Task	Dataset	Result
Head Avatar Rendering	Authors' Monocular RGB Video Dataset Subject 2 (test)	LPIPS0.188	6
Head Avatar Rendering	NerFACE Subject 4 (test)	LPIPS0.093	6
Head Avatar Rendering	Authors' Monocular RGB Video Dataset Subject 1 (test)	LPIPS0.182	6
Head Avatar Rendering	Authors' Monocular RGB Video Dataset Subject 0 (test)	LPIPS0.205	6
3D-aware face reconstruction	NerFACE (test)	PSNR24.092	6
Head Avatar Rendering	Authors' Monocular RGB Video Dataset Subject 3 (test)	LPIPS0.229	6
Facial Avatar Reconstruction	Real Human Face Videos (test)	PSNR26.77	5
Head Avatar Reconstruction	19 Video Sequences NHA, IMAvatar, and NeRFace	L2 Distance0.0018	4
Portrait Reanimation	Captured Video Sequence Subject 4 (test)	PSNR28.47	4
Portrait Reanimation	Captured Video Sequence Subject 2 (test)	PSNR24.57	4

Showing 10 of 17 rows

Other info

Follow for update

@wizwand_team Discord