Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting

About

With the advent of portable 360{\deg} cameras, panorama has gained significant attention in applications like virtual reality (VR), virtual tours, robotics, and autonomous driving. As a result, wide-baseline panorama view synthesis has emerged as a vital task, where high resolution, fast inference, and memory efficiency are essential. Nevertheless, existing methods are typically constrained to lower resolutions (512 $\times$ 1024) due to demanding memory and computational requirements. In this paper, we present PanSplat, a generalizable, feed-forward approach that efficiently supports resolution up to 4K (2048 $\times$ 4096). Our approach features a tailored spherical 3D Gaussian pyramid with a Fibonacci lattice arrangement, enhancing image quality while reducing information redundancy. To accommodate the demands of high resolution, we propose a pipeline that integrates a hierarchical spherical cost volume and Gaussian heads with local operations, enabling two-step deferred backpropagation for memory-efficient training on a single A100 GPU. Experiments demonstrate that PanSplat achieves state-of-the-art results with superior efficiency and image quality across both synthetic and real-world datasets. Code is available at https://github.com/chengzhag/PanSplat.

Cheng Zhang, Haofei Xu, Qianyi Wu, Camilo Cruz Gambardella, Dinh Phung, Jianfei Cai• 2024

Related benchmarks

TaskDatasetResultRank
Two-view reconstructionMatterport3D (test)
WS-PSNR28.83
18
Two-view reconstructionReplica (test)
WS-PSNR30.78
6
Two-view reconstructionResidential (test)
WS-PSNR30.97
6
Novel View SynthesisMatterport3D (train)
Training Time (s/iter)2.17
6
Two-view reconstruction360Loc
WS-PSNR28.24
5
Novel View SynthesisReplica 20 (test)
PSNR31.821
4
Panoramic Novel View SynthesisMatterport3D
Parameters (M)20.5
4
Panoramic View SynthesisMatterport3D baseline 2.0m (test)
LRCE0.137
4
Panoramic View SynthesisMatterport3D 1.0m baseline (test)
LRCE0.088
4
Depth EstimationMatterport3D 2.0m baseline
AbsRel0.41
4
Showing 10 of 22 rows

Other info

Follow for update