Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DepthSplat: Connecting Gaussian Splatting and Depth

About

Gaussian splatting and single-view depth estimation are typically studied in isolation. In this paper, we present DepthSplat to connect Gaussian splatting and depth estimation and study their interactions. More specifically, we first contribute a robust multi-view depth model by leveraging pre-trained monocular depth features, leading to high-quality feed-forward 3D Gaussian splatting reconstructions. We also show that Gaussian splatting can serve as an unsupervised pre-training objective for learning powerful depth models from large-scale multi-view posed datasets. We validate the synergy between Gaussian splatting and depth estimation through extensive ablation and cross-task transfer experiments. Our DepthSplat achieves state-of-the-art performance on ScanNet, RealEstate10K and DL3DV datasets in terms of both depth estimation and novel view synthesis, demonstrating the mutual benefits of connecting both tasks. In addition, DepthSplat enables feed-forward reconstruction from 12 input views (512x960 resolutions) in 0.6 seconds.

Haofei Xu, Songyou Peng, Fangjinhua Wang, Hermann Blum, Daniel Barath, Andreas Geiger, Marc Pollefeys• 2024

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisLLFF
PSNR17.64
124
Novel View SynthesisRealEstate10K
PSNR27.504
116
Monocular Depth EstimationNYU V2
Delta 1 Acc0.619
113
Novel View SynthesisMip-NeRF360
PSNR13.85
104
Novel View SynthesisDTU
PSNR15.59
100
Novel View SynthesisRe10K (test)
PSNR19.53
66
Novel View SynthesisDL3DV
PSNR28.141
61
Novel View SynthesisScanNet
PSNR20.201
58
Novel View SynthesisDL3DV (test)
PSNR27.66
54
Novel View SynthesisReplica
PSNR19.369
39
Showing 10 of 49 rows

Other info

Code

Follow for update