Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GaussFusion: Improving 3D Reconstruction in the Wild with A Geometry-Informed Video Generator

About

We present GaussFusion, a novel approach for improving 3D Gaussian splatting (3DGS) reconstructions in the wild through geometry-informed video generation. GaussFusion mitigates common 3DGS artifacts, including floaters, flickering, and blur caused by camera pose errors, incomplete coverage, and noisy geometry initialization. Unlike prior RGB-based approaches limited to a single reconstruction pipeline, our method introduces a geometry-informed video-to-video generator that refines 3DGS renderings across both optimization-based and feed-forward methods. Given an existing reconstruction, we render a Gaussian primitive video buffer encoding depth, normals, opacity, and covariance, which the generator refines to produce temporally coherent, artifact-free frames. We further introduce an artifact synthesis pipeline that simulates diverse degradation patterns, ensuring robustness and generalization. GaussFusion achieves state-of-the-art performance on novel-view synthesis benchmarks, and an efficient variant runs in real time at 15 FPS while maintaining similar performance, enabling interactive 3D applications.

Liyuan Zhu, Manjunath Narayana, Michal Stary, Will Hutchcroft, Gordon Wetzstein, Iro Armeni• 2026

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisRe10K (test)
PSNR28.652
79
Novel View SynthesisDL3DV (test)
PSNR22.548
61
3D Scene ReconstructionRe10K (test)
LPIPS17.5
15
3D Scene ReconstructionDL3DV (test)
LPIPS0.279
14
Novel View SynthesisRE10K official (test)
PSNR22.802
9
Showing 5 of 5 rows

Other info

Follow for update