CGS-GAN: 3D Consistent Gaussian Splatting GANs for High Resolution Human Head Synthesis

About

Recently, 3D GANs based on 3D Gaussian splatting have been proposed for high quality synthesis of human heads. However, existing methods stabilize training and enhance rendering quality from steep viewpoints by conditioning the random latent vector on the current camera position. This compromises 3D consistency, as we observe significant identity changes when re-synthesizing the 3D head with each camera shift. Conversely, fixing the camera to a single viewpoint yields high-quality renderings for that perspective but results in poor performance for novel views. Removing view-conditioning typically destabilizes GAN training, often causing the training to collapse. In response to these challenges, we introduce CGS-GAN, a novel 3D Gaussian Splatting GAN framework that enables stable training and high-quality 3D-consistent synthesis of human heads without relying on view-conditioning. To ensure training stability, we introduce a multi-view regularization technique that enhances generator convergence with minimal computational overhead. Additionally, we adapt the conditional loss used in existing 3D Gaussian splatting GANs and propose a generator architecture designed to not only stabilize training but also facilitate efficient rendering and straightforward scaling, enabling output resolutions up to $2048^2$. To evaluate the capabilities of CGS-GAN, we curate a new dataset derived from FFHQ. This dataset enables very high resolutions, focuses on larger portions of the human head, reduces view-dependent artifacts for improved 3D consistency, and excludes images where subjects are obscured by hands or other objects. As a result, our approach achieves very high rendering quality, supported by competitive FID scores, while ensuring consistent 3D scene generation. Check our our project page here: https://fraunhoferhhi.github.io/cgs-gan/

Florian Barthel, Wieland Morgenstern, Paul Hinzer, Anna Hilsmann, Peter Eisert• 2025

Related benchmarks

Task	Dataset	Result
Perceptual Realism	FFHQ	FID3D4.94	16
Image Editing (Remove glasses)	FFHQ (test)	--	15
Unconditional 3D human head synthesis	FFQHC	FID3D4.52	8
Perceptual Realism	FFHQ-C	FID3D4.53	8
Facial Attribute Editing	FFHQ	--	6
Attribute Editing (Adding Glasses)	FFHQ	LPIPS Stability0.239	5
Multi-view Consistency for 3D Head Synthesis	FFHQ-C (100 avatars)	CD0.6724	2

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord