PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting
About
With the advent of portable 360{\deg} cameras, panorama has gained significant attention in applications like virtual reality (VR), virtual tours, robotics, and autonomous driving. As a result, wide-baseline panorama view synthesis has emerged as a vital task, where high resolution, fast inference, and memory efficiency are essential. Nevertheless, existing methods are typically constrained to lower resolutions (512 $\times$ 1024) due to demanding memory and computational requirements. In this paper, we present PanSplat, a generalizable, feed-forward approach that efficiently supports resolution up to 4K (2048 $\times$ 4096). Our approach features a tailored spherical 3D Gaussian pyramid with a Fibonacci lattice arrangement, enhancing image quality while reducing information redundancy. To accommodate the demands of high resolution, we propose a pipeline that integrates a hierarchical spherical cost volume and Gaussian heads with local operations, enabling two-step deferred backpropagation for memory-efficient training on a single A100 GPU. Experiments demonstrate that PanSplat achieves state-of-the-art results with superior efficiency and image quality across both synthetic and real-world datasets. Code is available at https://github.com/chengzhag/PanSplat.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Two-view reconstruction | Matterport3D (test) | WS-PSNR28.83 | 18 | |
| Two-view reconstruction | Replica (test) | WS-PSNR30.78 | 6 | |
| Two-view reconstruction | Residential (test) | WS-PSNR30.97 | 6 | |
| Novel View Synthesis | Matterport3D (train) | Training Time (s/iter)2.17 | 6 | |
| Two-view reconstruction | 360Loc | WS-PSNR28.24 | 5 | |
| Novel View Synthesis | Replica 20 (test) | PSNR31.821 | 4 | |
| Panoramic Novel View Synthesis | Matterport3D | Parameters (M)20.5 | 4 | |
| Panoramic View Synthesis | Matterport3D baseline 2.0m (test) | LRCE0.137 | 4 | |
| Panoramic View Synthesis | Matterport3D 1.0m baseline (test) | LRCE0.088 | 4 | |
| Depth Estimation | Matterport3D 2.0m baseline | AbsRel0.41 | 4 |