Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VolSplat: Rethinking Feed-Forward 3D Gaussian Splatting with Voxel-Aligned Prediction

About

Feed-forward 3D Gaussian Splatting (3DGS) has emerged as a highly effective solution for novel view synthesis. Existing methods predominantly rely on a \emph{pixel-aligned} Gaussian prediction paradigm, where each 2D pixel is mapped to a 3D Gaussian. We rethink this widely adopted formulation and identify several inherent limitations: it renders the reconstructed 3D models heavily dependent on the number of input views, leads to view-biased density distributions, and introduces alignment errors, particularly when source views contain occlusions or low texture. To address these challenges, we introduce VolSplat, a new multi-view feed-forward paradigm that replaces pixel alignment with voxel-aligned Gaussians. By directly predicting Gaussians from a predicted 3D voxel grid, it overcomes pixel alignment's reliance on error-prone 2D feature matching, ensuring robust multi-view consistency. Furthermore, it enables adaptive control over density based on 3D scene complexity, yielding more faithful Gaussians, improved geometric consistency, and enhanced novel-view rendering quality. Experiments on widely used benchmarks demonstrate that VolSplat achieves state-of-the-art performance, while producing more plausible and view-consistent results. The video results, code and trained models are available on our project page: https://lhmd.top/volsplat.

Weijie Wang, Yeqing Chen, Zeyu Zhang, Hengyu Liu, Haoxiao Wang, Zhiyuan Feng, Wenkang Qin, Feng Chen, Zheng Zhu, Donny Y. Chen, Bohan Zhuang• 2025

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisTanks&Temples (test)
PSNR20.61
289
Novel View SynthesisRE10K
SSIM94.1
161
Novel View SynthesisACID
PSNR32.65
71
Novel View SynthesisReplica (test)
PSNR28.66
67
GS Depth RenderingTAT Dataset
RMSE0.294
54
GS Depth RenderingReplica Dataset
RMSE0.149
54
GS Depth RenderingDTU Dataset
RMSE0.032
54
Novel View SynthesisTAT Dataset (test)
PSNR17.78
30
Novel View SynthesisRealEstate10K 12 view
PSNR29.4
13
Novel View SynthesisRealEstate10K 6 views (test)
PSNR31.3
8
Showing 10 of 13 rows

Other info

Follow for update