Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OmniStyle: Filtering High Quality Style Transfer Data at Scale

About

In this paper, we introduce OmniStyle-1M, a large-scale paired style transfer dataset comprising over one million content-style-stylized image triplets across 1,000 diverse style categories, each enhanced with textual descriptions and instruction prompts. We show that OmniStyle-1M can not only enable efficient and scalable of style transfer models through supervised training but also facilitate precise control over target stylization. Especially, to ensure the quality of the dataset, we introduce OmniFilter, a comprehensive style transfer quality assessment framework, which filters high-quality triplets based on content preservation, style consistency, and aesthetic appeal. Building upon this foundation, we propose OmniStyle, a framework based on the Diffusion Transformer (DiT) architecture designed for high-quality and efficient style transfer. This framework supports both instruction-guided and image-guided style transfer, generating high resolution outputs with exceptional detail. Extensive qualitative and quantitative evaluations demonstrate OmniStyle's superior performance compared to existing approaches, highlighting its efficiency and versatility. OmniStyle-1M and its accompanying methodologies provide a significant contribution to advancing high-quality style transfer, offering a valuable resource for the research community.

Ye Wang, Ruiqi Liu, Jiang Lin, Fei Liu, Zili Yi, Yilin Wang, Rui Ma• 2025

Related benchmarks

TaskDatasetResultRank
reference-guided style transferOmniConsistency-Bench
FID130.4
20
Style TransferCSG-Bench
FID129.8
20
Affective Image StylizationEmoEdit (inference)
CLIP Score0.71
11
Image Editingour dataset film-grey style
PSNR15.91
11
Image Editingfilm-dream-blue style
PSNR12.55
11
Style EditingStyle Editing Dataset isp style
PSNR14.14
11
Style EditingStyleQoRA lomo style (test)
PSNR11.11
11
Controllable Style GenerationCSG-Bench Text-guided
Content Preference Rate1.8
9
Controllable Style GenerationCSG-Bench Reference-guided
Content Preference Rate2
9
Image StylizationCustom Triplet Dataset 21 styles (test)
CLIP Score65.39
9
Showing 10 of 16 rows

Other info

Follow for update