Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements

About

Text-driven style transfer aims to merge the style of a reference image with content described by a text prompt. Recent advancements in text-to-image models have improved the nuance of style transformations, yet significant challenges remain, particularly with overfitting to reference styles, limiting stylistic control, and misaligning with textual content. In this paper, we propose three complementary strategies to address these issues. First, we introduce a cross-modal Adaptive Instance Normalization (AdaIN) mechanism for better integration of style and text features, enhancing alignment. Second, we develop a Style-based Classifier-Free Guidance (SCFG) approach that enables selective control over stylistic elements, reducing irrelevant influences. Finally, we incorporate a teacher model during early generation stages to stabilize spatial layouts and mitigate artifacts. Our extensive evaluations demonstrate significant improvements in style transfer quality and alignment with textual prompts. Furthermore, our approach can be integrated into existing style transfer frameworks without fine-tuning.

Mingkun Lei, Xue Song, Beier Zhu, Hao Wang, Chi Zhang• 2024

Related benchmarks

TaskDatasetResultRank
Stylized GenerationStyleBench
CLIP TA0.23
9
Text-driven Style TransferBenchmark of 52 prompts and 20 style images 1.0 (test)
Text Alignment0.235
8
Style TransferUser Study 60 questions derived from 20 diverse pairs 1.0
Text Align0.153
7
Showing 3 of 3 rows

Other info

Code

Follow for update