Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation

About

Balancing content fidelity and artistic style is a pivotal challenge in image generation. While traditional style transfer methods and modern Denoising Diffusion Probabilistic Models (DDPMs) strive to achieve this balance, they often struggle to do so without sacrificing either style, content, or sometimes both. This work addresses this challenge by analyzing the ability of DDPMs to maintain content and style equilibrium. We introduce a novel method to identify sensitivities within the DDPM attention layers, identifying specific layers that correspond to different stylistic aspects. By directing conditional inputs only to these sensitive layers, our approach enables fine-grained control over style and content, significantly reducing issues arising from over-constrained inputs. Our findings demonstrate that this method enhances recent stylization techniques by better aligning style and content, ultimately improving the quality of generated visual content.

Nadav Z. Cohen, Oron Nir, Ariel Shamir• 2024

Related benchmarks

TaskDatasetResultRank
Style-Driven Image GenerationSDXL Complex Prompts 1.0 (test)
Content Score0.364
21
Style-Driven Image GenerationSDXL Easy Prompts 1.0 (test)
Content0.308
21
Style-Driven Image GenerationSDXL Easy + Complex Averaged 1.0 (test)
Content Score0.329
7
Qualitative User Preference EvaluationUser Study Multi-choice 1.0 (test)
Observed Votes386
2
Qualitative User Preference EvaluationUser Study A/B B-LoRA 1.0 (test)
Observed Votes195
2
Qualitative User Preference EvaluationUser Study A/B Test StyleAligned 1.0 (test)
Observed Votes185
2
Showing 6 of 6 rows

Other info

Code

Follow for update