Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PhotoWCT$^2$: Compact Autoencoder for Photorealistic Style Transfer Resulting from Blockwise Training and Skip Connections of High-Frequency Residuals

About

Photorealistic style transfer is an image editing task with the goal to modify an image to match the style of another image while ensuring the result looks like a real photograph. A limitation of existing models is that they have many parameters, which in turn prevents their use for larger image resolutions and leads to slower run-times. We introduce two mechanisms that enable our design of a more compact model that we call PhotoWCT$^2$, which preserves state-of-art stylization strength and photorealism. First, we introduce blockwise training to perform coarse-to-fine feature transformations that enable state-of-art stylization strength in a single autoencoder in place of the inefficient cascade of four autoencoders used in PhotoWCT. Second, we introduce skip connections of high-frequency residuals in order to preserve image quality when applying the sequential coarse-to-fine feature transformations. Our PhotoWCT$^2$ model requires fewer parameters (e.g., 30.3\% fewer) while supporting higher resolution images (e.g., 4K) and achieving faster stylization than existing models.

Tai-Yin Chiu, Danna Gurari• 2021

Related benchmarks

TaskDatasetResultRank
Photorealistic Style TransferNovel content-style pairs
Runtime (s)0.3
39
Image Color Style TransferLuan et al. (test)
CPU Inference Time (s)3.111
15
Color-Conditional Image GenerationContraStyles + Unsplash Lite (test)
2-Wasserstein Distance0.1028
13
Text-to-image generation conditioned on a reference color distributionSD generations Unconditional 1.5
2-Wasserstein Distance0.1085
12
Low-light Image EnhancementLOL and SICE Averaged v1, v2 (test)
Content Similarity0.857
9
Tone manipulationMIT Adobe FiveK
Content Sim.78.7
9
Photorealistic Style TransferPhotorealistic Style Transfer Evaluation Set (N=7000) (test)
SQA36.7
8
Color Style Transfer20 image sets (val)
Average Ranking4.11
7
Color Style TransferFHD 1920 x 1080
Inference Time (s)0.291
6
Photorealistic Style TransferUser Study
Win Rate (H2S)61.62
6
Showing 10 of 13 rows

Other info

Follow for update