Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

RegionRoute: Regional Style Transfer with Diffusion Model

About

Precise spatial control in diffusion-based style transfer remains challenging. This challenge arises because diffusion models treat style as a global feature and lack explicit spatial grounding of style representations, making it difficult to restrict style application to specific objects or regions. To our knowledge, existing diffusion models are unable to perform true localized style transfer, typically relying on handcrafted masks or multi-stage post-processing that introduce boundary artifacts and limit generalization. To address this, we propose an attention-supervised diffusion framework that explicitly teaches the model where to apply a given style by aligning the attention scores of style tokens with object masks during training. Two complementary objectives, a Focus loss based on KL divergence and a Cover loss using binary cross-entropy, jointly encourage accurate localization and dense coverage. A modular LoRA-MoE design further enables efficient and scalable multi-style adaptation. To evaluate localized stylization, we introduce the Regional Style Editing Score, which measures Regional Style Matching through CLIP-based similarity within the target region and Identity Preservation via masked LPIPS and pixel-level consistency on unedited areas. Experiments show that our method achieves mask-free, single-object style transfer at inference, producing regionally accurate and visually coherent results that outperform existing diffusion-based editing approaches.

Bowen Chen, Jake Zuena, Alan C. Bovik, Divya Kothandaraman• 2026

Related benchmarks

TaskDatasetResultRank
Controllable Image EditingCOCO
Obj Target Style Adherence (%)73
7
Controllable Image EditingPascal VOC
Object Target Style Adherence Q174
7
Controllable Image EditingBIG
Obj Style Adherence (Target)76
7
Regional Style EditingCOCO (test)
RSM0.6128
7
Regional Style EditingPascal VOC re-labeled (test)
RSM0.6147
7
Regional Style EditingBIG (test)
RSM0.6159
7
Showing 6 of 6 rows

Other info

Follow for update