DiffuScene: Denoising Diffusion Models for Generative Indoor Scene Synthesis
About
We present DiffuScene for indoor 3D scene synthesis based on a novel scene configuration denoising diffusion model. It generates 3D instance properties stored in an unordered object set and retrieves the most similar geometry for each object configuration, which is characterized as a concatenation of different attributes, including location, size, orientation, semantics, and geometry features. We introduce a diffusion network to synthesize a collection of 3D indoor objects by denoising a set of unordered object attributes. Unordered parametrization simplifies and eases the joint distribution approximation. The shape feature diffusion facilitates natural object placements, including symmetries. Our method enables many downstream applications, including scene completion, scene arrangement, and text-conditioned scene synthesis. Experiments on the 3D-FRONT dataset show that our method can synthesize more physically plausible and diverse indoor scenes than state-of-the-art methods. Extensive ablation studies verify the effectiveness of our design choice in scene diffusion models.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Indoor Scene Synthesis | Bedroom (Standard Split) | CNR33.8 | 13 | |
| Text-to-scene generation | 3D-FRONT Bedroom (test) | FID129.3 | 10 | |
| Text-to-scene generation | 3D-FRONT Livingroom (test) | FID135.9 | 10 | |
| Text-to-scene generation | 3D-FRONT Diningroom (test) | FID142.4 | 10 | |
| Scene Rearrangement | 3D-FRONT Living room | KID2.24 | 8 | |
| Scene Rearrangement | 3D-FRONT Bedroom | KID1.02 | 8 | |
| 3D Indoor Scene Synthesis | Living Room (Standard Split) | OBR28 | 7 | |
| 3D Indoor Scene Synthesis | Avg. Bed + Living (Standard Split) | OBR37.1 | 7 | |
| Unconditional Scene Synthesis | 3D-FRONT Bedroom (test) | Precision82.31 | 5 | |
| Unconditional Scene Synthesis | 3D-FRONT Dining (test) | Precision82.8 | 5 |