Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Manifold Alignment for Semantically Aligned Style Transfer

About

Most existing style transfer methods follow the assumption that styles can be represented with global statistics (e.g., Gram matrices or covariance matrices), and thus address the problem by forcing the output and style images to have similar global statistics. An alternative is the assumption of local style patterns, where algorithms are designed to swap similar local features of content and style images. However, the limitation of these existing methods is that they neglect the semantic structure of the content image which may lead to corrupted content structure in the output. In this paper, we make a new assumption that image features from the same semantic region form a manifold and an image with multiple semantic regions follows a multi-manifold distribution. Based on this assumption, the style transfer problem is formulated as aligning two multi-manifold distributions and a Manifold Alignment based Style Transfer (MAST) framework is proposed. The proposed framework allows semantically similar regions between the output and the style image share similar style patterns. Moreover, the proposed manifold alignment method is flexible to allow user editing or using semantic segmentation maps as guidance for style transfer. To allow the method to be applicable to photorealistic style transfer, we propose a new adaptive weight skip connection network structure to preserve the content details. Extensive experiments verify the effectiveness of the proposed framework for both artistic and photorealistic style transfer. Code is available at https://github.com/NJUHuoJing/MAST.

Jing Huo, Shiyin Jin, Wenbin Li, Jing Wu, Yu-Kun Lai, Yinghuan Shi, Yang Gao• 2020

Related benchmarks

TaskDatasetResultRank
Style TransferMS-COCO and WikiArt
Execution Time (s)0.126
48
Artistic Style TransferCOCO and WikiArt (test)
LPIPS0.32
11
Semantic Style Transferquadruple data (val)
SSL1.732
11
Style TransferMS-COCO & WikiArt 512 x 512 images
Average Inference Time (s)0.239
11
Old Photo ModernizationCHD real old photos
NIQE4.8111
7
Old Photo ModernizationCHD (user study)
Top-1 Accuracy0.0568
6
Old Photo ModernizationCHD Synthetic Degraded
PSNR19.0148
5
Showing 7 of 7 rows

Other info

Follow for update