Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation

About

3D immersive scene generation is a challenging yet critical task in computer vision and graphics. A desired virtual 3D scene should 1) exhibit omnidirectional view consistency, and 2) allow for free exploration in complex scene hierarchies. Existing methods either rely on successive scene expansion via inpainting or employ panorama representation to represent large FOV scene environments. However, the generated scene suffers from semantic drift during expansion and is unable to handle occlusion among scene hierarchies. To tackle these challenges, we introduce Layerpano3D, a novel framework for full-view, explorable panoramic 3D scene generation from a single text prompt. Our key insight is to decompose a reference 2D panorama into multiple layers at different depth levels, where each layer reveals the unseen space from the reference views via diffusion prior. Layerpano3D comprises multiple dedicated designs: 1) We introduce a new panorama dataset Upright360, comprising 9k high-quality and upright panorama images, and finetune the advanced Flux model on Upright360 for high-quality, upright and consistent panorama generation. 2) We pioneer the Layered 3D Panorama as underlying representation to manage complex scene hierarchies and lift it into 3D Gaussians to splat detailed 360-degree omnidirectional scenes with unconstrained viewing paths. Extensive experiments demonstrate that our framework generates state-of-the-art 3D panoramic scene in both full view consistency and immersive exploratory experience. We believe that Layerpano3D holds promise for advancing 3D panoramic scene creation with numerous applications.

Shuai Yang, Jing Tan, Mengchen Zhang, Tong Wu, Yixuan Li, Gordon Wetzstein, Ziwei Liu, Dahua Lin• 2024

Related benchmarks

TaskDatasetResultRank
Novel View SynthesisBlender
PSNR18.124
64
Text-to-3D Scene Generation60 indoor and outdoor samples (test)
BRISQUE33.119
7
Text-to-3D Scene GenerationText-to-3D generation evaluation (test)
IQA+0.3949
7
Text-to-Panorama GenerationStructured3D
CLIP Score30.95
7
3D Scene GenerationStructured3D (novel views)
IS2
4
Novel View SynthesisInfinigen Indoors
PSNR18.305
4
Novel View SynthesisInfinigen Outdoors
PSNR17.364
4
Novel View SynthesisInfinigen & Blender Average
PSNR17.931
4
Immersive Scene GenerationUser Study n=10 participants (15 video comparisons)
Visual Appeal Score1
4
Showing 9 of 9 rows

Other info

Follow for update