Instant Volumetric Head Avatars

About

We present Instant Volumetric Head Avatars (INSTA), a novel approach for reconstructing photo-realistic digital avatars instantaneously. INSTA models a dynamic neural radiance field based on neural graphics primitives embedded around a parametric face model. Our pipeline is trained on a single monocular RGB portrait video that observes the subject under different expressions and views. While state-of-the-art methods take up to several days to train an avatar, our method can reconstruct a digital avatar in less than 10 minutes on modern GPU hardware, which is orders of magnitude faster than previous solutions. In addition, it allows for the interactive rendering of novel poses and expressions. By leveraging the geometry prior of the underlying parametric face model, we demonstrate that INSTA extrapolates to unseen poses. In quantitative and qualitative studies on various subjects, INSTA outperforms state-of-the-art methods regarding rendering quality and training time.

Wojciech Zielonka, Timo Bolkart, Justus Thies• 2022

Related benchmarks

Task	Dataset	Result
Self-Reenactment	HDTF	PSNR25.03	35
Self-Reenactment	INSTA	PSNR27.85	19
Monocular Reenactment	SplattingAvatar	MSE1.555	10
3D Head Avatar Reconstruction	NHA, NerFace, PointAvatar, INSTA, and custom captures (test)	L1 Error0.016	8
Monocular 3D Head Avatar Creation	NeRSemble	PSNR15.8	8
Head Avatar Rendering	INSTA	Inverse MAE76.4	7
Self-Reenactment	self-captured dataset	PSNR25.91	6
Monocular 3D Head Avatar Reconstruction	INSTA In-the-Wild Data	LPIPS0.048	5
Side-view reconstruction	Marcel ±25° renderings averaged	O_alpha0.0779	5
Facial cross-person reenactment	Facial cross-person reenactment dataset	E_feat_cos0.9087	5

Showing 10 of 17 rows

Other info

Code

Follow for update

@wizwand_team Discord