Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
About
Gatys et al. recently introduced a neural algorithm that renders a content image in the style of another image, achieving so-called style transfer. However, their framework requires a slow iterative optimization process, which limits its practical application. Fast approximations with feed-forward neural networks have been proposed to speed up neural style transfer. Unfortunately, the speed improvement comes at a cost: the network is usually tied to a fixed set of styles and cannot adapt to arbitrary new styles. In this paper, we present a simple yet effective approach that for the first time enables arbitrary style transfer in real-time. At the heart of our method is a novel adaptive instance normalization (AdaIN) layer that aligns the mean and variance of the content features with those of the style features. Our method achieves speed comparable to the fastest existing approach, without the restriction to a pre-defined set of styles. In addition, our approach allows flexible user controls such as content-style trade-off, style interpolation, color & spatial controls, all using a single feed-forward neural network.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Style Transfer | MS-COCO and WikiArt | Execution Time (s)0.004 | 48 | |
| Image Enhancement | SICE Underexposure v2 | PSNR22.06 | 34 | |
| Style Transfer | MS-COCO (content) + WikiArt (style) (test) | LPIPS0.56 | 31 | |
| Image Style Transfer | User Study | Overall Quality Score23.5 | 30 | |
| Camouflaged Image Synthesis | LAKE-RED Camouflaged Objects | KL_BF0.8821 | 28 | |
| Image Style Transfer | (test) | Average Inference Time (s)0.007 | 22 | |
| Image Enhancement | SICE Overexposure v2 | PSNR19.5 | 17 | |
| Style Transfer | MS-COCO (content) + WikiArt (style) (test) | Lcont4.88 | 17 | |
| Camouflaged Image Synthesis | LAKE-RED (Overall) | KL_BF1.114 | 14 | |
| Camouflaged Image Synthesis | LAKE-RED Salient Objects | KL_BF1.3065 | 14 |