Photorealistic Style Transfer via Wavelet Transforms
About
Recent style transfer models have provided promising artistic results. However, given a photograph as a reference style, existing methods are limited by spatial distortions or unrealistic artifacts, which should not happen in real photographs. We introduce a theoretically sound correction to the network architecture that remarkably enhances photorealism and faithfully transfers the style. The key ingredient of our method is wavelet transforms that naturally fits in deep networks. We propose a wavelet corrected transfer based on whitening and coloring transforms (WCT$^2$) that allows features to preserve their structural information and statistical properties of VGG feature space during stylization. This is the first and the only end-to-end model that can stylize a $1024\times1024$ resolution image in 4.7 seconds, giving a pleasing and photorealistic quality without any post-processing. Last but not least, our model provides a stable video stylization without temporal constraints. Our code, generated images, and pre-trained models are all available at https://github.com/ClovaAI/WCT2.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Photorealistic Style Transfer | Novel content-style pairs | Runtime (s)0.04 | 39 | |
| Image Color Style Transfer | Luan et al. (test) | CPU Inference Time (s)24.204 | 15 | |
| Style Transfer | PST50 | CP Count68.18 | 15 | |
| Tone Style Transfer | PST50 | CP0.6482 | 15 | |
| Style Transfer | TST2K | CP Count41.16 | 15 | |
| Tone Style Transfer | TST2K | PSNR19.16 | 14 | |
| Color-Conditional Image Generation | ContraStyles + Unsplash Lite (test) | 2-Wasserstein Distance0.1347 | 13 | |
| Text-to-image generation conditioned on a reference color distribution | SD generations Unconditional 1.5 | 2-Wasserstein Distance0.1425 | 12 | |
| Photorealistic Style Transfer | Photorealistic Style Transfer Evaluation Set (N=7000) (test) | SQA16.1 | 8 | |
| Color Style Transfer | 20 image sets (val) | Average Ranking2.67 | 7 |