Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent

About

Indoor scene synthesis has become increasingly important with the rise of Embodied AI, which requires 3D environments that are not only visually realistic but also physically plausible and functionally diverse. While recent approaches have advanced visual fidelity, they often remain constrained to fixed scene categories, lack sufficient object-level detail and physical consistency, and struggle to align with complex user instructions. In this work, we present SceneWeaver, a reflective agentic framework that unifies diverse scene synthesis paradigms through tool-based iterative refinement. At its core, SceneWeaver employs a language model-based planner to select from a suite of extensible scene generation tools, ranging from data-driven generative models to visual- and LLM-based methods, guided by self-evaluation of physical plausibility, visual realism, and semantic alignment with user input. This closed-loop reason-act-reflect design enables the agent to identify semantic inconsistencies, invoke targeted tools, and update the environment over successive iterations. Extensive experiments on both common and open-vocabulary room types demonstrate that SceneWeaver not only outperforms prior methods on physical, visual, and semantic metrics, but also generalizes effectively to complex scenes with diverse instructions, marking a step toward general-purpose 3D environment generation. Project website: https://scene-weaver.github.io/.

Yandan Yang, Baoxiong Jia, Shujie Zhang, Siyuan Huang• 2025

Related benchmarks

TaskDatasetResultRank
Indoor Scene Generation179 room-level prompts
Realism Win Rate91.7
12
Indoor Scene SynthesisUser Study
Visual Quality3.42
8
Visual Narrative GenerationCineBoard3D (test)
CIDS (Self)0.872
7
Scene editingE2A-Bench
IF Score68.7
5
3D Scene GenerationIndoor Scenes 8
Layout Correctness5.8
3
Robot ManipulationSAGE Generated Scenes (test)
Success Rate14.4
3
Scene GenerationCommon indoor scenes Kitchen
Object Count37.5
3
Robot ManipulationSceneWeaver Baseline 1 Generated Scenes (test)
Success Rate13.2
3
Robot ManipulationHolodeck Generated Scenes Baseline 2 (test)
Success Rate9.3
3
Scene GenerationCommon indoor scenes Bedroom
Object Count17.5
3
Showing 10 of 12 rows

Other info

Follow for update