From Geometric Mimicry to Comprehensive Generation: A Context-Informed Multimodal Diffusion Model for Urban Morphology Synthesis
About
Urban morphology is fundamental to determining urban functionality and vitality. Prevailing simulation methods, however, often oversimplify morphological generation as a geometric problem, lacking a profound understanding of urban semantics and geographical context. To address this limitation, this study proposes ControlCity, a diffusion model that achieves comprehensive urban morphology generation through multimodal information fusion. We first constructed a quadruple dataset comprising ``image-text-metadata-building footprints" from 22 cities worldwide. ControlCity utilizes these multidimensional information as joint control conditions, where an enhanced ControlNet architecture encodes spatial constraints from images, while text and metadata provide semantic guidance and geographical priors respectively, collectively directing the generation process. Experimental results demonstrate that compared to unimodal baselines, this method achieves significant advantages in morphological fidelity, with visual error (FID) reduced by 71.01%, reaching 50.94, and spatial overlap (MIoU) improved by 38.46%, reaching 0.36. Furthermore, the model demonstrates robust knowledge generalization and controllability, enabling cross-city style transfer and zero-shot generation for unknown cities. Ablation studies further reveal the distinct roles of images, text, and metadata in the generation process. This study confirms that multimodal fusion is crucial for achieving the transition from ``geometric mimicry" to ``understanding-based comprehensive generation," providing a novel paradigm for urban morphology research and applications.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Urban Morphology Generation | Ten Global Cities Average | FID50.94 | 3 | |
| Urban Morphology Generation | Beijing | FID55.13 | 2 | |
| Urban Morphology Generation | Frankfurt | FID53.36 | 2 | |
| Urban Morphology Generation | Jakarta | FID74.38 | 2 | |
| Urban Morphology Generation | London | FID43.93 | 2 | |
| Urban Morphology Generation | Los Angeles | FID28.56 | 2 | |
| Urban Morphology Generation | New York City | FID37.81 | 2 | |
| Urban Morphology Generation | Rotterdam | FID76.24 | 2 | |
| Urban Morphology Generation | Seattle | FID47.8 | 2 | |
| Urban Morphology Generation | Shanghai | FID43.04 | 2 |