Efficient Geometry-aware 3D Generative Adversarial Networks

About

Unsupervised generation of high-quality multi-view-consistent images and 3D shapes using only collections of single-view 2D photographs has been a long-standing challenge. Existing 3D GANs are either compute-intensive or make approximations that are not 3D-consistent; the former limits quality and resolution of the generated images and the latter adversely affects multi-view consistency and shape quality. In this work, we improve the computational efficiency and image quality of 3D GANs without overly relying on these approximations. We introduce an expressive hybrid explicit-implicit network architecture that, together with other design choices, synthesizes not only high-resolution multi-view-consistent images in real time but also produces high-quality 3D geometry. By decoupling feature generation and neural rendering, our framework is able to leverage state-of-the-art 2D CNN generators, such as StyleGAN2, and inherit their efficiency and expressiveness. We demonstrate state-of-the-art 3D-aware synthesis with FFHQ and AFHQ Cats, among other experiments.

Eric R. Chan, Connor Z. Lin, Matthew A. Chan, Koki Nagano, Boxiao Pan, Shalini De Mello, Orazio Gallo, Leonidas Guibas, Jonathan Tremblay, Sameh Khamis, Tero Karras, Gordon Wetzstein• 2021

Related benchmarks

Task	Dataset	Result
3D Scene Representation	Multi-Object Scalability	Memory Footprint (GB)1.5	40
3D Scene Reconstruction	ShapeNet cars	Total Training Time (days)44.7	40
Unconditional image synthesis	FFHQ 256x256 (test)	FID4.8	31
Image Synthesis	FFHQ	FID4.8	16
Perceptual Realism	FFHQ	FID3D4.7	16
Novel View Synthesis	Basel Faces	PSNR36.47	14
Rendering	FFHQ	Total Rendering Time (ms)27	13
Unconditional image synthesis	AFHQ 256x256 (test)	FID3.9	12
3D-aware head synthesis	FFHQ	FID3.28	10
3D-aware Image Synthesis	Cats (test)	FID5.56	9

Showing 10 of 54 rows

Other info

Follow for update

@wizwand_team Discord