Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Instant Volumetric Head Avatars

About

We present Instant Volumetric Head Avatars (INSTA), a novel approach for reconstructing photo-realistic digital avatars instantaneously. INSTA models a dynamic neural radiance field based on neural graphics primitives embedded around a parametric face model. Our pipeline is trained on a single monocular RGB portrait video that observes the subject under different expressions and views. While state-of-the-art methods take up to several days to train an avatar, our method can reconstruct a digital avatar in less than 10 minutes on modern GPU hardware, which is orders of magnitude faster than previous solutions. In addition, it allows for the interactive rendering of novel poses and expressions. By leveraging the geometry prior of the underlying parametric face model, we demonstrate that INSTA extrapolates to unseen poses. In quantitative and qualitative studies on various subjects, INSTA outperforms state-of-the-art methods regarding rendering quality and training time.

Wojciech Zielonka, Timo Bolkart, Justus Thies• 2022

Related benchmarks

TaskDatasetResultRank
Self-ReenactmentINSTA
PSNR27.85
14
Self-ReenactmentHDTF
PSNR25.03
14
Monocular 3D Head Avatar CreationNeRSemble
PSNR15.8
8
Head Avatar RenderingINSTA
Inverse MAE76.4
7
Self-Reenactmentself-captured dataset
PSNR25.91
6
Facial cross-person reenactmentFacial cross-person reenactment dataset
E_feat_cos0.9087
5
Head Avatar RenderingMonocular video for head avatar
PSNR26.42
5
3D Head Avatar ReconstructionMonocular RGB videos (test)
LPIPS0.149
5
Novel expression and view synthesisNeRSemble (novel expressions and views)
PSNR27.9181
5
Novel View SynthesisNeRSemble (novel-view split)
PSNR27.7786
5
Showing 10 of 13 rows

Other info

Code

Follow for update