FlashAvatar: High-fidelity Head Avatar with Efficient Gaussian Embedding

About

We propose FlashAvatar, a novel and lightweight 3D animatable avatar representation that could reconstruct a digital avatar from a short monocular video sequence in minutes and render high-fidelity photo-realistic images at 300FPS on a consumer-grade GPU. To achieve this, we maintain a uniform 3D Gaussian field embedded in the surface of a parametric face model and learn extra spatial offset to model non-surface regions and subtle facial details. While full use of geometric priors can capture high-frequency facial details and preserve exaggerated expressions, proper initialization can help reduce the number of Gaussians, thus enabling super-fast rendering speed. Extensive experimental results demonstrate that FlashAvatar outperforms existing works regarding visual quality and personalized details and is almost an order of magnitude faster in rendering speed. Project page: https://ustc3dv.github.io/FlashAvatar/

Jun Xiang, Xuan Gao, Yudong Guo, Juyong Zhang• 2023

Related benchmarks

Task	Dataset	Result
Novel Expression Synthesis	NeRSemble	PSNR16.94	41
Self-Reenactment	HDTF	PSNR27.58	35
Novel View Synthesis	NeRSemble	SSIM78.5	24
3D Head Reconstruction	NeRSemble (test)	PSNR18.1	20
Self-Reenactment	INSTA	PSNR29.13	19
Head Avatar Reconstruction	INSTA Dataset	PSNR27.9	14
Self-Reenactment	NeRSemble Novel Expression frontal	PSNR21.24	13
Self-Reenactment	NeRSemble Novel Expression (all view)	PSNR16.95	13
Monocular Reenactment	SplattingAvatar	MSE1.173	10
Head Avatar Reconstruction	INSTA dataset (test)	PSNR (bala)31.98	8

Showing 10 of 39 rows

Other info

Code

Follow for update

@wizwand_team Discord