Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FreeScene: Mixed Graph Diffusion for 3D Scene Synthesis from Free Prompts

About

Controllability plays a crucial role in the practical applications of 3D indoor scene synthesis. Existing works either allow rough language-based control, that is convenient but lacks fine-grained scene customization, or employ graph based control, which offers better controllability but demands considerable knowledge for the cumbersome graph design process. To address these challenges, we present FreeScene, a user-friendly framework that enables both convenient and effective control for indoor scene synthesis.Specifically, FreeScene supports free-form user inputs including text description and/or reference images, allowing users to express versatile design intentions. The user inputs are adequately analyzed and integrated into a graph representation by a VLM-based Graph Designer. We then propose MG-DiT, a Mixed Graph Diffusion Transformer, which performs graph-aware denoising to enhance scene generation. Our MG-DiT not only excels at preserving graph structure but also offers broad applicability to various tasks, including, but not limited to, text-to-scene, graph-to-scene, and rearrangement, all within a single model. Extensive experiments demonstrate that FreeScene provides an efficient and user-friendly solution that unifies text-based and graph based scene synthesis, outperforming state-of-the-art methods in terms of both generation quality and controllability in a range of applications.

Tongyuan Bai, Wangyuanfan Bai, Dong Chen, Tieru Wu, Manyi Li, Rui Ma• 2025

Related benchmarks

TaskDatasetResultRank
Text-to-scene generation3D-FRONT Bedroom (test)
FID108
10
Text-to-scene generation3D-FRONT Livingroom (test)
FID108.2
10
Text-to-scene generation3D-FRONT Diningroom (test)
FID125.3
10
Indoor Scene CompletionBedroom (test)
FID81.83
4
Indoor Scene CompletionLivingroom (test)
FID91.01
4
Indoor Scene CompletionDiningroom (test)
FID105.7
4
Physical Plausibility AssessmentBedroom
Total Intersection Volume241.3
4
Physical Plausibility AssessmentLiving room
Total Intersection Volume472.7
4
Physical Plausibility AssessmentDining room
Total Intersection Volume367.7
4
StylizationIndoor Scenes Living
Delta (x 1e-3)0.38
4
Showing 10 of 39 rows

Other info

Code

Follow for update