Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields

About

Fully unsupervised 3D representation learning has gained attention owing to its advantages in data collection. A successful approach involves a viewpoint-aware approach that learns an image distribution based on generative models (e.g., generative adversarial networks (GANs)) while generating various view images based on 3D-aware models (e.g., neural radiance fields (NeRFs)). However, they require images with various views for training, and consequently, their application to datasets with few or limited viewpoints remains a challenge. As a complementary approach, an aperture rendering GAN (AR-GAN) that employs a defocus cue was proposed. However, an AR-GAN is a CNN-based model and represents a defocus independently from a viewpoint change despite its high correlation, which is one of the reasons for its performance. As an alternative to an AR-GAN, we propose an aperture rendering NeRF (AR-NeRF), which can utilize viewpoint and defocus cues in a unified manner by representing both factors in a common ray-tracing framework. Moreover, to learn defocus-aware and defocus-independent representations in a disentangled manner, we propose aperture randomized training, for which we learn to generate images while randomizing the aperture size and latent codes independently. During our experiments, we applied AR-NeRF to various natural image datasets, including flower, bird, and face images, the results of which demonstrate the utility of AR-NeRF for unsupervised learning of the depth and defocus effects.

Takuhiro Kaneko• 2022

Related benchmarks

TaskDatasetResultRank
Image SynthesisFFHQ (test)
FID7.8
8
Image SynthesisOxford Flowers (test)
KID7.86
7
Image SynthesisCUB-200-2011 (test)
KID6.81
7
Unsupervised Depth and Defocus LearningOxford Flowers
KID7.86
4
Unsupervised Depth and Defocus LearningCUB-200 2011
KID6.81
4
Unsupervised Depth and Defocus LearningFFHQ
KID3.67
4
Depth PredictionOxford Flowers (test)
SIDE3.94
3
Depth PredictionFFHQ (test)
SIDE2.61
3
Depth PredictionCUB-200-2011 (test)
SIDE3.63
3
Showing 9 of 9 rows

Other info

Code

Follow for update