Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DepthSplat: Connecting Gaussian Splatting and Depth

About

Gaussian splatting and single-view depth estimation are typically studied in isolation. In this paper, we present DepthSplat to connect Gaussian splatting and depth estimation and study their interactions. More specifically, we first contribute a robust multi-view depth model by leveraging pre-trained monocular depth features, leading to high-quality feed-forward 3D Gaussian splatting reconstructions. We also show that Gaussian splatting can serve as an unsupervised pre-training objective for learning powerful depth models from large-scale multi-view posed datasets. We validate the synergy between Gaussian splatting and depth estimation through extensive ablation and cross-task transfer experiments. Our DepthSplat achieves state-of-the-art performance on ScanNet, RealEstate10K and DL3DV datasets in terms of both depth estimation and novel view synthesis, demonstrating the mutual benefits of connecting both tasks. In addition, DepthSplat enables feed-forward reconstruction from 12 input views (512x960 resolutions) in 0.6 seconds.

Haofei Xu, Songyou Peng, Fangjinhua Wang, Hermann Blum, Daniel Barath, Andreas Geiger, Marc Pollefeys• 2024

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisRealEstate10K
PSNR27.61
173
Novel View SynthesisRE10K
SSIM87.8
142
Novel View SynthesisMip-NeRF360
PSNR13.85
138
Monocular Depth EstimationNYU V2
Delta 1 Acc0.619
131
Novel View SynthesisScanNet
PSNR20.201
130
Novel View SynthesisLLFF
PSNR17.64
130
Novel View SynthesisDTU
PSNR15.59
115
Novel View SynthesisLLFF (test)
PSNR13.9
91
Novel View SynthesisDL3DV
PSNR28.141
84
Novel View SynthesisRe10K (test)
PSNR19.53
79
Showing 10 of 130 rows
...

Other info

Code

Follow for update