Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CSGO: Content-Style Composition in Text-to-Image Generation

About

The diffusion model has shown exceptional capabilities in controlled image generation, which has further fueled interest in image style transfer. Existing works mainly focus on training free-based methods (e.g., image inversion) due to the scarcity of specific data. In this study, we present a data construction pipeline for content-style-stylized image triplets that generates and automatically cleanses stylized data triplets. Based on this pipeline, we construct a dataset IMAGStyle, the first large-scale style transfer dataset containing 210k image triplets, available for the community to explore and research. Equipped with IMAGStyle, we propose CSGO, a style transfer model based on end-to-end training, which explicitly decouples content and style features employing independent feature injection. The unified CSGO implements image-driven style transfer, text-driven stylized synthesis, and text editing-driven stylized synthesis. Extensive experiments demonstrate the effectiveness of our approach in enhancing style control capabilities in image generation. Additional visualization and access to the source code can be located on the project page: \url{https://csgo-gen.github.io/}.

Peng Xing, Haofan Wang, Yanpeng Sun, Qixun Wang, Xu Bai, Hao Ai, Renyuan Huang, Zechao Li• 2024

Related benchmarks

TaskDatasetResultRank
Image Style TransferUser Study
Overall Quality Score57
30
Style TransferMS-COCO and WikiArt 1,000 images each
ArtFID27.116
11
Image Style TransferStyle Transfer 750 images (test)
Style Score0.5224
10
Stylized GenerationStyleBench
CLIP TA0.223
9
Style TransferCIFAR-100 and InstaStyle (test)
Content Score27.7
9
Image StylizationCustom Triplet Dataset 21 styles (test)
CLIP Score63.41
9
Style TransferStyle-Content Pairs 50 style x 40 content references (test)
CSD Score0.535
8
Text-driven Style TransferBenchmark of 52 prompts and 20 style images 1.0 (test)
Text Alignment0.216
8
Style TransferBCS-Bench
DINO0.6022
8
Style TransferStyle Transfer Evaluation Set (test)
Style Score55.02
8
Showing 10 of 14 rows

Other info

Follow for update