Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction

About

We introduce pixelSplat, a feed-forward model that learns to reconstruct 3D radiance fields parameterized by 3D Gaussian primitives from pairs of images. Our model features real-time and memory-efficient rendering for scalable training as well as fast 3D reconstruction at inference time. To overcome local minima inherent to sparse and locally supported representations, we predict a dense probability distribution over 3D and sample Gaussian means from that probability distribution. We make this sampling operation differentiable via a reparameterization trick, allowing us to back-propagate gradients through the Gaussian splatting representation. We benchmark our method on wide-baseline novel view synthesis on the real-world RealEstate10k and ACID datasets, where we outperform state-of-the-art light field transformers and accelerate rendering by 2.5 orders of magnitude while reconstructing an interpretable and editable 3D radiance field.

David Charatan, Sizhe Li, Andrea Tagliasacchi, Vincent Sitzmann• 2023

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisRealEstate10K
PSNR26.09
173
Novel View SynthesisRE10K
SSIM80.8
142
Monocular Depth EstimationNYU V2
Delta 1 Acc0.138
131
Novel View SynthesisLLFF
PSNR22.99
130
Novel View SynthesisScanNet
PSNR19.606
130
Novel View SynthesisDTU
PSNR15.067
115
Novel View SynthesisDL3DV
PSNR27.201
84
Novel View SynthesisACID
PSNR28.284
71
Novel View SynthesisReplica
PSNR26.28
69
Novel View SynthesisScanNet++
PSNR18.434
67
Showing 10 of 93 rows
...

Other info

Follow for update