Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Depth-supervised NeRF: Fewer Views and Faster Training for Free

About

A commonly observed failure mode of Neural Radiance Field (NeRF) is fitting incorrect geometries when given an insufficient number of input views. One potential reason is that standard volumetric rendering does not enforce the constraint that most of a scene's geometry consist of empty space and opaque surfaces. We formalize the above assumption through DS-NeRF (Depth-supervised Neural Radiance Fields), a loss for learning radiance fields that takes advantage of readily-available depth supervision. We leverage the fact that current NeRF pipelines require images with known camera poses that are typically estimated by running structure-from-motion (SFM). Crucially, SFM also produces sparse 3D points that can be used as "free" depth supervision during training: we add a loss to encourage the distribution of a ray's terminating depth matches a given 3D keypoint, incorporating depth uncertainty. DS-NeRF can render better images given fewer training views while training 2-3x faster. Further, we show that our loss is compatible with other recently proposed NeRF methods, demonstrating that depth is a cheap and easily digestible supervisory signal. And finally, we find that DS-NeRF can support other types of depth supervision such as scanned depth sensors and RGB-D reconstruction outputs.

Kangle Deng, Andrew Liu, Jun-Yan Zhu, Deva Ramanan• 2021

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisRealEstate10K
PSNR26.65
173
Novel View SynthesisDTU 6-view
PSNR20.6
58
Novel View SynthesisDTU 3-view
PSNR16.9
58
Novel View SynthesisDTU (val)
PSNR (full)22.3
43
Novel View SynthesisLLFF 3-view (test)
PSNR20.2
39
Novel View SynthesisRealEstate-10K 2-view
PSNR25.44
32
Novel View SynthesisDTU 9-view
PSNR22.3
31
Novel View SynthesisNeRF-LLFF
LPIPS0.2979
30
Novel View SynthesisLLFF 4 input views (test)
LPIPS0.2979
20
View SynthesisRedwood-3dscan (test)
PSNR23.9
19
Showing 10 of 74 rows
...

Other info

Code

Follow for update