HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction
About
Reconstructing 3D scenes from multiple viewpoints is a fundamental task in stereo vision. Recently, advances in generalizable 3D Gaussian Splatting have enabled high-quality novel view synthesis for unseen scenes from sparse input views by feed-forward predicting per-pixel Gaussian parameters without extra optimization. However, existing methods typically generate single-scale 3D Gaussians, which lack representation of both large-scale structure and texture details, resulting in mislocation and artefacts. In this paper, we propose a novel framework, HiSplat, which introduces a hierarchical manner in generalizable 3D Gaussian Splatting to construct hierarchical 3D Gaussians via a coarse-to-fine strategy. Specifically, HiSplat generates large coarse-grained Gaussians to capture large-scale structures, followed by fine-grained Gaussians to enhance delicate texture details. To promote inter-scale interactions, we propose an Error Aware Module for Gaussian compensation and a Modulating Fusion Module for Gaussian repair. Our method achieves joint optimization of hierarchical representations, allowing for novel view synthesis using only two-view reference images. Comprehensive experiments on various datasets demonstrate that HiSplat significantly enhances reconstruction quality and cross-dataset generalization compared to prior single-scale methods. The corresponding ablation study and analysis of different-scale 3D Gaussians reveal the mechanism behind the effectiveness. Project website: https://open3dvlab.github.io/HiSplat/
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | RealEstate10K | PSNR27.193 | 173 | |
| Novel View Synthesis | RE10K | SSIM87.2 | 142 | |
| Novel View Synthesis | ScanNet | PSNR19.095 | 130 | |
| Novel View Synthesis | DTU | PSNR16.019 | 115 | |
| Novel View Synthesis | DL3DV | PSNR26.242 | 84 | |
| Novel View Synthesis | ACID | PSNR28.737 | 71 | |
| Novel View Synthesis | Replica | PSNR17.157 | 69 | |
| Novel View Synthesis | T&T small-viewpoint set (O) | PSNR15.57 | 44 | |
| Novel View Synthesis | RE10K Small | PSNR12.38 | 38 | |
| New View Synthesis | T&T | LPIPS0.612 | 33 |