GaussFusion: Improving 3D Reconstruction in the Wild with A Geometry-Informed Video Generator
About
We present GaussFusion, a novel approach for improving 3D Gaussian splatting (3DGS) reconstructions in the wild through geometry-informed video generation. GaussFusion mitigates common 3DGS artifacts, including floaters, flickering, and blur caused by camera pose errors, incomplete coverage, and noisy geometry initialization. Unlike prior RGB-based approaches limited to a single reconstruction pipeline, our method introduces a geometry-informed video-to-video generator that refines 3DGS renderings across both optimization-based and feed-forward methods. Given an existing reconstruction, we render a Gaussian primitive video buffer encoding depth, normals, opacity, and covariance, which the generator refines to produce temporally coherent, artifact-free frames. We further introduce an artifact synthesis pipeline that simulates diverse degradation patterns, ensuring robustness and generalization. GaussFusion achieves state-of-the-art performance on novel-view synthesis benchmarks, and an efficient variant runs in real time at 15 FPS while maintaining similar performance, enabling interactive 3D applications.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | Re10K (test) | PSNR28.652 | 79 | |
| Novel View Synthesis | DL3DV (test) | PSNR22.548 | 61 | |
| 3D Scene Reconstruction | Re10K (test) | LPIPS17.5 | 15 | |
| 3D Scene Reconstruction | DL3DV (test) | LPIPS0.279 | 14 | |
| Novel View Synthesis | RE10K official (test) | PSNR22.802 | 9 |