Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Image

About

We introduce a 3D-aware diffusion model, ZeroNVS, for single-image novel view synthesis for in-the-wild scenes. While existing methods are designed for single objects with masked backgrounds, we propose new techniques to address challenges introduced by in-the-wild multi-object scenes with complex backgrounds. Specifically, we train a generative prior on a mixture of data sources that capture object-centric, indoor, and outdoor scenes. To address issues from data mixture such as depth-scale ambiguity, we propose a novel camera conditioning parameterization and normalization scheme. Further, we observe that Score Distillation Sampling (SDS) tends to truncate the distribution of complex backgrounds during distillation of 360-degree scenes, and propose "SDS anchoring" to improve the diversity of synthesized novel views. Our model sets a new state-of-the-art result in LPIPS on the DTU dataset in the zero-shot setting, even outperforming methods specifically trained on DTU. We further adapt the challenging Mip-NeRF 360 dataset as a new benchmark for single-image novel view synthesis, and demonstrate strong performance in this setting. Our code and data are at http://kylesargent.github.io/zeronvs/

Kyle Sargent, Zizhang Li, Tanmay Shah, Charles Herrmann, Hong-Xing Yu, Yunzhi Zhang, Eric Ryan Chan, Dmitry Lagun, Li Fei-Fei, Deqing Sun, Jiajun Wu• 2023

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisTanks&Temples (test)
PSNR13.18
257
Novel View SynthesisMip-NeRF 360 (test)
PSNR15.81
184
Novel View SynthesisRealEstate10K
PSNR23.73
173
Novel View SynthesisMip-NeRF360
PSNR15.99
138
Novel View SynthesisLLFF
PSNR18.79
130
Novel View SynthesisDTU
PSNR17.92
115
Novel View SynthesisCO3D
PSNR20.5
24
Novel View SynthesisRealEstate10K Hard
PSNR14.24
20
Novel View SynthesisRealEstate10K Easy
PSNR16.5
20
Few-view 3D ReconstructionRealEstate10K (test)
PSNR23.73
20
Showing 10 of 36 rows

Other info

Code

Follow for update