Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Handwritten Text Generation from Visual Archetypes

About

Generating synthetic images of handwritten text in a writer-specific style is a challenging task, especially in the case of unseen styles and new words, and even more when these latter contain characters that are rarely encountered during training. While emulating a writer's style has been recently addressed by generative models, the generalization towards rare characters has been disregarded. In this work, we devise a Transformer-based model for Few-Shot styled handwritten text generation and focus on obtaining a robust and informative representation of both the text and the style. In particular, we propose a novel representation of the textual content as a sequence of dense vectors obtained from images of symbols written as standard GNU Unifont glyphs, which can be considered their visual archetypes. This strategy is more suitable for generating characters that, despite having been seen rarely during training, possibly share visual details with the frequently observed ones. As for the style, we obtain a robust representation of unseen writers' calligraphy by exploiting specific pre-training on a large synthetic dataset. Quantitative and qualitative results demonstrate the effectiveness of our proposal in generating words in unseen styles and with rare characters more faithfully than existing approaches relying on independent one-hot encodings of the characters.

Vittorio Pippi, Silvia Cascianelli, Rita Cucchiara• 2023

Related benchmarks

TaskDatasetResultRank
Handwritten Text GenerationIAM word-level
FID30.26
16
Handwriting generationIAM (test)
FID17.79
9
Handwriting SynthesisCVL line-level
FID34.4
8
Handwritten Text GenerationCVL Lines (test)
FID34.4
8
Handwriting SynthesisIAM Lines
FID35.32
8
Handwritten Text GenerationIAM Lines
FID35.32
8
Handwriting SynthesisRIMES line-level
FID113.8
8
Handwritten Text GenerationRIMES Lines (test)
FID113.8
8
Line-level Text-to-Image SynthesisKaraoke Typewritten (test)
FID80.38
8
Styled Text GenerationKaraoke (Typewritten)
FID80.38
8
Showing 10 of 18 rows

Other info

Code

Follow for update