Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

R$^3$L: Reasoning 3D Layouts from Relative Spatial Relations

About

Relative spatial relations provide a compact representation of spatial structure and are fundamental to relative spatial reasoning in 3D layout generation. Recent works leverage Multimodal Large Language Models (MLLMs) to infer such relations, but the inferred relations are often unreliable and are typically handled with post-hoc heuristics. In this paper, we propose R$^3$L, a general framework that improves the reliability and consistency of relative spatial reasoning for 3D layout generation. Our key motivation is that multi-hop reasoning requires repeated reference-frame transformations, which accumulate errors in inferred relations and lead to semantic and metric drift. To mitigate this, we propose invariant spatial decomposition to break coupled relation chains, and consistent spatial imagination to promote self-consistency through an imagine-and-revise loop. We further introduce supportive spatial optimization to ease pose optimization via global-to-local coordinate re-parameterization. Extensive experiments across diverse scene types and instructions demonstrate that R$^3$L produces more physically feasible and semantically consistent layouts. Notably, our analysis shows that resolving frame-induced inconsistencies is crucial for reliable multi-hop relative spatial reasoning. The code is available at https://github.com/Neal2020GitHub/R3L.

Zhifeng Gu, Yuqi Wang, Bing Wang• 2026

Related benchmarks

TaskDatasetResultRank
3D Layout GenerationUser Study 150 participants
Realism Score6.7
4
3D Layout GenerationUser Study (test)
Elo Rating1.91e+3
4
3D Layout Generation3D Layout Generation Dataset Game Room
CLIP Score25.32
4
3D Layout Generation3D Layout Generation Dataset Kitchen
CLIP Score25.07
4
3D Layout Generation3D Layout Generation Dataset
CLIP Score26.67
4
Instruction-driven 3D layout generation27 tasks across 9 scene types
Elo Rating1.87e+3
4
3D Layout Generation3D Layout Generation Dataset (Bathroom)
CLIP Score24.26
4
3D Layout Generation3D Layout Generation Dataset (Bookstore)
CLIP Score21.55
4
3D Layout Generation3D Layout Generation Dataset (Gym)
CLIP Score30.5
4
3D Layout Generation3D Layout Generation Dataset Living Room
CLIP Score25.95
4
Showing 10 of 25 rows

Other info

Follow for update