Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Less-to-More Generalization: Unlocking More Controllability by In-Context Generation

About

Although subject-driven generation has been extensively explored in image generation due to its wide applications, it still has challenges in data scalability and subject expansibility. For the first challenge, moving from curating single-subject datasets to multiple-subject ones and scaling them is particularly difficult. For the second, most recent methods center on single-subject generation, making it hard to apply when dealing with multi-subject scenarios. In this study, we propose a highly-consistent data synthesis pipeline to tackle this challenge. This pipeline harnesses the intrinsic in-context generation capabilities of diffusion transformers and generates high-consistency multi-subject paired data. Additionally, we introduce UNO, which consists of progressive cross-modal alignment and universal rotary position embedding. It is a multi-image conditioned subject-to-image model iteratively trained from a text-to-image model. Extensive experiments show that our method can achieve high consistency while ensuring controllability in both single-subject and multi-subject driven generation.

Shaojin Wu, Mengqi Huang, Wenxu Wu, Yufeng Cheng, Fei Ding, Qian He• 2025

Related benchmarks

TaskDatasetResultRank
Subject-driven image generationDreamBench
DINO Score76
100
Instructive image editingMagicBrush (test)
CLIP Image0.9236
37
Subject-driven generationDreamBench
DINO Score0.76
28
Cinematic Story GenerationViStoryBench
CSD (Cross)0.391
24
Multi-image ReasoningOmniContext
Single Scene Char Score7.15
20
Personalized Text-to-Image GenerationDreamBench++ Single-subject
CP0.721
18
Virtual Try-On and AnimationViViD Dataset
L10.2125
18
Virtual Try-On and AnimationInternet Dataset
L1 Loss0.1774
18
Multi-image context generationMICON-Bench
Object Score62.3
18
Identity-preserving Image GenerationMultiID-Bench 1-people
Sim(GT)0.304
18
Showing 10 of 41 rows

Other info

Follow for update