Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LayoutDiffuse: Adapting Foundational Diffusion Models for Layout-to-Image Generation

About

Layout-to-image generation refers to the task of synthesizing photo-realistic images based on semantic layouts. In this paper, we propose LayoutDiffuse that adapts a foundational diffusion model pretrained on large-scale image or text-image datasets for layout-to-image generation. By adopting a novel neural adaptor based on layout attention and task-aware prompts, our method trains efficiently, generates images with both high perceptual quality and layout alignment, and needs less data. Experiments on three datasets show that our method significantly outperforms other 10 generative models based on GANs, VQ-VAE, and diffusion models.

Jiaxin Cheng, Xiao Liang, Xingjian Shi, Tong He, Tianjun Xiao, Mu Li• 2023

Related benchmarks

TaskDatasetResultRank
Object DetectionMS-COCO
AP36.6
77
Layout-to-Image GenerationCOCO 3K (test)
FID20.27
5
Showing 2 of 2 rows

Other info

Follow for update