Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Implicit Style-Content Separation using B-LoRA

About

Image stylization involves manipulating the visual appearance and texture (style) of an image while preserving its underlying objects, structures, and concepts (content). The separation of style and content is essential for manipulating the image's style independently from its content, ensuring a harmonious and visually pleasing result. Achieving this separation requires a deep understanding of both the visual and semantic characteristics of images, often necessitating the training of specialized models or employing heavy optimization. In this paper, we introduce B-LoRA, a method that leverages LoRA (Low-Rank Adaptation) to implicitly separate the style and content components of a single image, facilitating various image stylization tasks. By analyzing the architecture of SDXL combined with LoRA, we find that jointly learning the LoRA weights of two specific blocks (referred to as B-LoRAs) achieves style-content separation that cannot be achieved by training each B-LoRA independently. Consolidating the training into only two blocks and separating style and content allows for significantly improving style manipulation and overcoming overfitting issues often associated with model fine-tuning. Once trained, the two B-LoRAs can be used as independent components to allow various image stylization tasks, including image style transfer, text-based image stylization, consistent style generation, and style-content mixing.

Yarden Frenkel, Yael Vinker, Ariel Shamir, Daniel Cohen-Or• 2024

Related benchmarks

TaskDatasetResultRank
Style-Driven Image GenerationSDXL Easy Prompts 1.0 (test)
Content0.293
21
Style-Driven Image GenerationSDXL Complex Prompts 1.0 (test)
Content Score0.316
21
Style-Driven Image GenerationSDXL Easy + Complex Averaged 1.0 (test)
Content Score0.289
7
Personalized Image GenerationUser Study 50 samples 1.0 (test)
Content Fidelity70
6
Personalized Image Generation10 distinct content-style pairs
Content Similarity (CLIP-I)0.74
6
Subject-Style LoRA FusionDreamBooth
Style Similarity58
5
Subject and style fusion30 unique content-style pairs (StyleDrop & Subject datasets) SDXL v1.0 based (test)
User Preference Score21.89
4
Qualitative User Preference EvaluationUser Study A/B B-LoRA 1.0 (test)
Observed Votes57
2
Showing 8 of 8 rows

Other info

Follow for update