Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

G3Splat: Geometrically Consistent Generalizable Gaussian Splatting

About

3D Gaussians have recently emerged as an effective scene representation for real-time splatting and accurate novel-view synthesis, motivating several works to adapt multi-view structure prediction networks to regress per-pixel 3D Gaussians from images. However, most prior work extends these networks to predict additional Gaussian parameters -- orientation, scale, opacity, and appearance -- while relying almost exclusively on view-synthesis supervision. We show that a view-synthesis loss alone is insufficient to recover geometrically meaningful splats in this setting. We analyze and address the ambiguities of learning 3D Gaussian splats under self-supervision for pose-free generalizable splatting, and introduce G3Splat, which enforces geometric priors to obtain geometrically consistent 3D scene representations. Trained on RE10K, our approach achieves state-of-the-art performance in (i) geometrically consistent reconstruction, (ii) relative pose estimation, and (iii) novel-view synthesis. We further demonstrate strong zero-shot generalization on ScanNet, substantially outperforming prior work in both geometry recovery and relative pose estimation. Code and pretrained models are released on our project page (https://m80hz.github.io/g3splat/).

Mehdi Hosseinzadeh, Shin-Fang Chng, Yi Xu, Simon Lucey, Ian Reid, Ravi Garg• 2025

Related benchmarks

TaskDatasetResultRank
Monocular Depth EstimationNYU V2
Delta 1 Acc0.434
131
Pose EstimationScanNet
AUC @ 5 deg14.8
41
Novel View SynthesisACID (test)
PSNR23.827
39
Novel View SynthesisRE10K Small
PSNR21.377
38
Pose EstimationRE10K
AUC @ 5°0.684
35
Novel View SynthesisScanNet (test)
PSNR21.168
34
Novel View SynthesisRE10K (Medium)
PSNR23.426
33
Novel View SynthesisRE10K (Average)
PSNR23.504
33
Novel View SynthesisRE10K Large
PSNR25.459
25
Pose EstimationACID
AUC @ 5°46.6
23
Showing 10 of 10 rows

Other info

Follow for update