Text and Style Conditioned GAN for Generation of Offline Handwriting Lines
About
This paper presents a GAN for generating images of handwritten lines conditioned on arbitrary text and latent style vectors. Unlike prior work, which produce stroke points or single-word images, this model generates entire lines of offline handwriting. The model produces variable-sized images by using style vectors to determine character widths. A generator network is trained with GAN and autoencoder techniques to learn style, and uses a pre-trained handwriting recognition network to induce legibility. A study using human evaluators demonstrates that the model produces images that appear to be written by a human. After training, the encoder network can extract a style vector from an image, allowing images in a similar style to be generated, but with arbitrary text.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Handwritten Text Generation | IAM word-level | FID129.6 | 16 | |
| Handwriting generation | IAM (test) | FID20.65 | 9 | |
| Handwriting Synthesis | RIMES line-level | FID109 | 8 | |
| Handwritten Text Generation | RIMES Lines (test) | FID109 | 8 | |
| Line-level Text-to-Image Synthesis | Karaoke Handwritten (test) | FID60.3 | 8 | |
| Styled Text Generation | Karaoke Calligraphy | FID60.3 | 8 | |
| Handwriting Synthesis | IAM Lines | FID44.17 | 8 | |
| Handwriting Synthesis | CVL line-level | FID42.12 | 8 | |
| Handwritten Text Generation | IAM Lines | FID44.17 | 8 | |
| Handwritten Text Generation | CVL Lines (test) | FID42.12 | 8 |