Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Gaussian Pixel Codec Avatars: A Hybrid Representation for Efficient Rendering

About

We present Gaussian Pixel Codec Avatars (GPiCA), photorealistic head avatars that can be generated from multi-view images and efficiently rendered on mobile devices. GPiCA utilizes a unique hybrid representation that combines a triangle mesh and anisotropic 3D Gaussians. This combination maximizes memory and rendering efficiency while maintaining a photorealistic appearance. The triangle mesh is highly efficient in representing surface areas like facial skin, while the 3D Gaussians effectively handle non-surface areas such as hair and beard. To this end, we develop a unified differentiable rendering pipeline that treats the mesh as a semi-transparent layer within the volumetric rendering paradigm of 3D Gaussian Splatting. We train neural networks to decode a facial expression code into three components: a 3D face mesh, an RGBA texture, and a set of 3D Gaussians. These components are rendered simultaneously in a unified rendering engine. The networks are trained using multi-view image supervision. Our results demonstrate that GPiCA achieves the realism of purely Gaussian-based avatars while matching the rendering performance of mesh-based avatars.

Divam Gupta, Anuj Pahuja, Nemanja Bartolovic, Tomas Simon, Forrest Iandola, Giljoo Nam• 2025

Related benchmarks

TaskDatasetResultRank
3D Avatar RenderingQuest 3 Mobile Benchmark (test)
LPIPS0.33
4
Head Avatar ReconstructionFace Dataset (Subject 1)
MAE7.65
4
Head Avatar ReconstructionFace Dataset (Subject 2)
MAE5.7
4
Head Avatar ReconstructionFace Dataset (Subject 3)
MAE6.13
4
Head Avatar ReconstructionFace Dataset (Subject 4)
MAE6.04
4
Head Avatar ReconstructionFace Dataset (Subject 5)
MAE6.21
4
Avatar Reconstructionfull body dataset
MAE2.85
4
Showing 7 of 7 rows

Other info

Follow for update