Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

G3Splat: Geometrically Consistent Generalizable Gaussian Splatting

About

3D Gaussians have recently emerged as an effective scene representation for real-time splatting and accurate novel-view synthesis, motivating several works to adapt multi-view structure prediction networks to regress per-pixel 3D Gaussians from images. However, most prior work extends these networks to predict additional Gaussian parameters -- orientation, scale, opacity, and appearance -- while relying almost exclusively on view-synthesis supervision. We show that a view-synthesis loss alone is insufficient to recover geometrically meaningful splats in this setting. We analyze and address the ambiguities of learning 3D Gaussian splats under self-supervision for pose-free generalizable splatting, and introduce G3Splat, which enforces geometric priors to obtain geometrically consistent 3D scene representations. Trained on RE10K, our approach achieves state-of-the-art performance in (i) geometrically consistent reconstruction, (ii) relative pose estimation, and (iii) novel-view synthesis. We further demonstrate strong zero-shot generalization on ScanNet, substantially outperforming prior work in both geometry recovery and relative pose estimation. Code and pretrained models are released on our project page (https://m80hz.github.io/g3splat/).

Mehdi Hosseinzadeh, Shin-Fang Chng, Yi Xu, Simon Lucey, Ian Reid, Ravi Garg• 2025

Related benchmarks

TaskDatasetResultRank
Monocular Depth EstimationNYU V2
Delta 1 Acc0.434
113
Pose EstimationScanNet
AUC @ 5 deg14.8
41
Novel View SynthesisScanNet (test)
PSNR21.168
25
Novel View SynthesisACID (test)
PSNR23.827
18
Novel View SynthesisRE10K Small
PSNR21.377
12
Novel View SynthesisRE10K (Medium)
PSNR23.426
12
Novel View SynthesisRE10K (Average)
PSNR23.504
12
Novel View SynthesisRE10K Large
PSNR25.459
12
Pose EstimationRE10K
AUC @ 5°0.684
11
Pose EstimationACID
AUC @ 5°46.6
11
Showing 10 of 10 rows

Other info

Follow for update