Realtime Data-Efficient Portrait Stylization Based On Geometric Alignment
About
Portrait Stylization aims to imbue portrait photos with vivid artistic effects drawn from style examples. Despite the availability of enormous training datasets and large network weights, existing methods struggle to maintain geometric consistency and achieve satisfactory stylization effects due to the disparity in facial feature distributions between facial photographs and stylized images, limiting the application on rare styles and mobile devices. To alleviate this, we propose to establish meaningful geometric correlations between portraits and style samples to simplify the stylization by aligning corresponding facial characteristics. Specifically, we integrate differentiable Thin-Plate-Spline (TPS) modules into an end-to-end Generative Adversarial Network (GAN) framework to improve the training efficiency and promote the consistency of facial identities. By leveraging inherent structural information of faces, e.g., facial landmarks, TPS module can establish geometric alignments between the two domains, at global and local scales, both in pixel and feature spaces, thereby overcoming the aforementioned challenges. Quantitative and qualitative comparisons on a range of portrait stylization tasks demonstrate that our models not only outperforms existing models in terms of fidelity and stylistic consistency, but also achieves remarkable improvements in 2x training data efficiency and 100x less computational complexity, allowing our lightweight model to achieve real-time inference (30 FPS) at 512*512 resolution on mobile devices.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Portrait Stylization | Animation | FID40.48 | 10 | |
| Portrait Stylization | Watercolor | FID132.9 | 10 | |
| Portrait Stylization | Oilpaint | FID77.74 | 10 | |
| Portrait Stylization | Inkpaint | FID100.4 | 10 | |
| Portrait Stylization | Portrait Stylization Ryzen 6900 CPU | Inference Time (ms)62.5 | 6 | |
| Portrait Stylization | Portrait Stylization | Inference Time (ms)12.4 | 6 | |
| Portrait Stylization | Portrait Stylization Snapdragon 8Gen1 SoC | Inference Time (ms)31.8 | 6 | |
| Portrait Stylization | Portrait Stylization datasets | -- | 1 |