GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation
About
3D-aware image generative modeling aims to generate 3D-consistent images with explicitly controllable camera poses. Recent works have shown promising results by training neural radiance field (NeRF) generators on unstructured 2D images, but still can not generate highly-realistic images with fine details. A critical reason is that the high memory and computation cost of volumetric representation learning greatly restricts the number of point samples for radiance integration during training. Deficient sampling not only limits the expressive power of the generator to handle fine details but also impedes effective GAN training due to the noise caused by unstable Monte Carlo sampling. We propose a novel approach that regulates point sampling and radiance field learning on 2D manifolds, embodied as a set of learned implicit surfaces in the 3D volume. For each viewing ray, we calculate ray-surface intersections and accumulate their radiance generated by the network. By training and rendering such radiance manifolds, our generator can produce high quality images with realistic fine details and strong visual 3D consistency.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Unconditional image synthesis | FFHQ 256x256 (test) | FID17.9 | 31 | |
| Image Generation | FFHQ 256x256 (train) | FID14.5 | 20 | |
| Image Synthesis | FFHQ | FID17.9 | 16 | |
| Rendering | FFHQ | Total Rendering Time (ms)136 | 13 | |
| Unconditional image synthesis | AFHQ 256x256 (test) | FID18.5 | 12 | |
| Image Generation | CARLA 128 x 128 (test) | FID26.3 | 9 | |
| Image Synthesis | AFHQ (full dataset) | FID18.5 | 8 | |
| Image Synthesis | Carla (full dataset) | FID26.3 | 7 | |
| Image Generation | Cats 256x256 256^2 (train) | FID14.6 | 5 | |
| Image Generation | CARLA 128^2 (train) | FID26.3 | 5 |