Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering

About

We propose a novel training-free image generation algorithm that precisely controls the occlusion relationships between objects in an image. Existing image generation methods typically rely on prompts to influence occlusion, which often lack precision. While layout-to-image methods provide control over object locations, they fail to address occlusion relationships explicitly. Given a pre-trained image diffusion model, our method leverages volume rendering principles to "render" the scene in latent space, guided by occlusion relationships and the estimated transmittance of objects. This approach does not require retraining or fine-tuning the image diffusion model, yet it enables accurate occlusion control due to its physics-grounded foundation. In extensive experiments, our method significantly outperforms existing approaches in terms of occlusion accuracy. Furthermore, we demonstrate that by adjusting the opacities of objects or concepts during rendering, our method can achieve a variety of effects, such as altering the transparency of objects, the density of mass (e.g., forests), the concentration of particles (e.g., rain, fog), the intensity of light, and the strength of lens effects, etc.

Xiaohang Zhan, Dingming Liu• 2025

Related benchmarks

TaskDatasetResultRank
Text-to-Image GenerationT2I-CompBench++--
95
Layout-to-Image GenerationOcclBench
mIoU (Foreground)27.38
8
Layout-Grounded Image GenerationOverLayBench Complex SA-Z Eval (300 samples)
Occlusion58.38
7
Layout-Grounded Image GenerationOverLayBench Regular
mIoU0.5721
7
Layout-Grounded Image GenerationSA-Z Eval
mIoU40.53
7
Layout-Grounded Image GenerationOverLayBench Simple
mIoU0.6604
7
Layout-Grounded Image GenerationOverLayBench Complex
mIoU52.27
7
Occluded Text RenderingOccluded Text Rendering (evaluation set)
Peak VRAM Usage (GB)18
6
Occluded Text Rendering64 occluded-text rendering samples
Text Similarity21.17
6
Showing 9 of 9 rows

Other info

Follow for update