DG-Font: Deformable Generative Networks for Unsupervised Font Generation
About
Font generation is a challenging problem especially for some writing systems that consist of a large number of characters and has attracted a lot of attention in recent years. However, existing methods for font generation are often in supervised learning. They require a large number of paired data, which is labor-intensive and expensive to collect. Besides, common image-to-image translation models often define style as the set of textures and colors, which cannot be directly applied to font generation. To address these problems, we propose novel deformable generative networks for unsupervised font generation (DGFont). We introduce a feature deformation skip connection (FDSC) which predicts pairs of displacement maps and employs the predicted maps to apply deformable convolution to the low-level feature maps from the content encoder. The outputs of FDSC are fed into a mixer to generate the final results. Taking advantage of FDSC, the mixer outputs a high-quality character with a complete structure. To further improve the quality of generated images, we use three deformable convolution layers in the content encoder to learn style-invariant feature representations. Experiments demonstrate that our model generates characters in higher quality than state-of-art methods. The source code is available at https://github.com/ecnuycxie/DG-Font.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Font Generation | Collected Chinese font dataset Unseen styles and Unseen contents | SSIM0.7553 | 13 | |
| Vision-only Few-shot Font Generation | Chinese font dataset Small (UFSC) | RMSE0.3208 | 10 | |
| Vision-only Few-shot Font Generation | Chinese font dataset Large (UFSC) | RMSE0.3117 | 10 | |
| Handwriting generation | UWSC (Unseen Writer Same Content) Traditional Chinese 1.0 | Content Score (CS)8.63 | 10 | |
| Handwriting generation | UWUC (Unseen Writer Unseen Content) Traditional Chinese 1.0 | Content Score (CS)10.86 | 10 | |
| Vision-only Few-shot Font Generation | Chinese font dataset Small (UFUC) | RMSE0.3173 | 9 | |
| Vision-only Few-shot Font Generation | Chinese font dataset Large (UFUC) | RMSE0.3074 | 9 | |
| Font Generation | Collected Chinese font dataset (Seen styles and Unseen contents) | SSIM0.7697 | 8 | |
| Font Generation | Chinese Font Dataset (Seen Fonts) | L1 Error0.0625 | 7 | |
| Font Generation | UFSC Unseen Fonts Seen Characters | L1 Loss0.131 | 6 |