Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SynCity: Training-Free Generation of 3D Worlds

About

We address the challenge of generating 3D worlds from textual descriptions. We propose SynCity, a training- and optimization-free approach, which leverages the geometric precision of pre-trained 3D generative models and the artistic versatility of 2D image generators to create large, high-quality 3D spaces. While most 3D generative models are object-centric and cannot generate large-scale worlds, we show how 3D and 2D generators can be combined to generate ever-expanding scenes. Through a tile-based approach, we allow fine-grained control over the layout and the appearance of scenes. The world is generated tile-by-tile, and each new tile is generated within its world-context and then fused with the scene. SynCity generates compelling and immersive scenes that are rich in detail and diversity.

Paul Engstler, Aleksandar Shtedritski, Iro Laina, Christian Rupprecht, Andrea Vedaldi• 2025

Related benchmarks

TaskDatasetResultRank
3D City Generation100 textual descriptions of cities
VQA Score0.6975
8
3D GenerationComputational Cost Evaluation
VRAM (GB)49
5
3D City GenerationTypical City Size (test)
VQAScore69.75
3
3D Scene GenerationChatGPT-generated scene prompts
CLIP Score0.251
2
Showing 4 of 4 rows

Other info

Follow for update