LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes
About
With the widespread usage of VR devices and contents, demands for 3D scene generation techniques become more popular. Existing 3D scene generation models, however, limit the target scene to specific domain, primarily due to their training strategies using 3D scan dataset that is far from the real-world. To address such limitation, we propose LucidDreamer, a domain-free scene generation pipeline by fully leveraging the power of existing large-scale diffusion-based generative model. Our LucidDreamer has two alternate steps: Dreaming and Alignment. First, to generate multi-view consistent images from inputs, we set the point cloud as a geometrical guideline for each image generation. Specifically, we project a portion of point cloud to the desired view and provide the projection as a guidance for inpainting using the generative model. The inpainted images are lifted to 3D space with estimated depth maps, composing a new points. Second, to aggregate the new points into the 3D scene, we propose an aligning algorithm which harmoniously integrates the portions of newly generated 3D scenes. The finally obtained 3D scene serves as initial points for optimizing Gaussian splats. LucidDreamer produces Gaussian splats that are highly-detailed compared to the previous 3D scene generation methods, with no constraint on domain of the target scene. Project page: https://luciddreamer-cvlab.github.io/
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | RealEstate10K | PSNR15.24 | 116 | |
| 3D Scene Generation | WorldScore | Camera Control88.93 | 33 | |
| Novel View Synthesis | CO3D | PSNR13.9 | 24 | |
| Novel View Synthesis | RealEstate10K Hard | PSNR14.13 | 20 | |
| Novel View Synthesis | RealEstate10K Easy | PSNR16.35 | 20 | |
| Novel View Synthesis | DTU Far-view (test) | PSNR12.96 | 14 | |
| Novel View Synthesis | DTU Near-view (test) | PSNR12.09 | 14 | |
| Text-to-3D Scene Generation | 60 indoor and outdoor samples (test) | BRISQUE48.08 | 7 | |
| Novel View Synthesis | Tanks-and-Temples (Easy set) | LPIPS0.413 | 6 | |
| Novel View Synthesis | Tanks-and-Temples (Hard set) | LPIPS0.558 | 6 |