Inversion-Based Style Transfer with Diffusion Models

About

The artistic style within a painting is the means of expression, which includes not only the painting material, colors, and brushstrokes, but also the high-level attributes including semantic elements, object shapes, etc. Previous arbitrary example-guided artistic image generation methods often fail to control shape changes or convey elements. The pre-trained text-to-image synthesis diffusion probabilistic models have achieved remarkable quality, but it often requires extensive textual descriptions to accurately portray attributes of a particular painting. We believe that the uniqueness of an artwork lies precisely in the fact that it cannot be adequately explained with normal language. Our key idea is to learn artistic style directly from a single painting and then guide the synthesis without providing complex textual descriptions. Specifically, we assume style as a learnable textual description of a painting. We propose an inversion-based style transfer method (InST), which can efficiently and accurately learn the key information of an image, thus capturing and transferring the artistic style of a painting. We demonstrate the quality and efficiency of our method on numerous paintings of various artists and styles. Code and models are available at https://github.com/zyxElsa/InST.

Yuxin Zhang, Nisha Huang, Fan Tang, Haibin Huang, Chongyang Ma, Weiming Dong, Changsheng Xu• 2022

Related benchmarks

Task	Dataset	Result
Style Transfer	ArtFID Benchmark (test)	ArtFID23.869	45
Style Transfer	MS-COCO (content) + WikiArt (style) (test)	LPIPS0.785	45
Image Style Transfer	User Study	Overall Quality Score83.9	30
Artistic Style Transfer	MS-COCO content images and WikiArt style images 512x512 resolution (test)	FID (Artistic Style)40.633	13
Photo-realistic transfer	MSCOCO	FID (Style)23.048	11
Affective Image Stylization	EmoEdit (inference)	CLIP Score0.569	11
Artistic transfer	WikiArt	FID (Style)21.571	11
Multi-style Image Transfer	MS-COCO (content) & WikiArt (style) Two-style setting Stable Diffusion v1.4 backbone (test)	ArtFID25.152	9
Style Transfer	User Study 10 content images, 8 style images (test)	Style Score0.4	9
Style Transfer	Single image on A100 GPU (test)	Inference Time (s)5	7

Showing 10 of 20 rows

Other info

Code

Follow for update

@wizwand_team Discord