Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VATr++: Choose Your Words Wisely for Handwritten Text Generation

About

Styled Handwritten Text Generation (HTG) has received significant attention in recent years, propelled by the success of learning-based solutions employing GANs, Transformers, and, preliminarily, Diffusion Models. Despite this surge in interest, there remains a critical yet understudied aspect - the impact of the input, both visual and textual, on the HTG model training and its subsequent influence on performance. This study delves deeper into a cutting-edge Styled-HTG approach, proposing strategies for input preparation and training regularization that allow the model to achieve better performance and generalize better. These aspects are validated through extensive analysis on several different settings and datasets. Moreover, in this work, we go beyond performance optimization and address a significant hurdle in HTG research - the lack of a standardized evaluation protocol. In particular, we propose a standardization of the evaluation protocol for HTG and conduct a comprehensive benchmarking of existing approaches. By doing so, we aim to establish a foundation for fair and meaningful comparisons between HTG strategies, fostering progress in the field.

Bram Vanherle, Vittorio Pippi, Silvia Cascianelli, Nick Michiels, Frank Van Reeth, Rita Cucchiara• 2024

Related benchmarks

TaskDatasetResultRank
Handwritten Text GenerationIAM word-level
FID31.91
16
Handwriting SynthesisIAM Lines
FID34
8
Handwritten Text GenerationIAM Lines
FID34
8
Handwriting SynthesisCVL line-level
FID35.53
8
Handwriting SynthesisRIMES line-level
FID110
8
Handwritten Text GenerationCVL Lines (test)
FID35.53
8
Handwritten Text GenerationRIMES Lines (test)
FID110
8
Line-level Text-to-Image SynthesisKaraoke Typewritten (test)
FID76.03
8
Styled Text GenerationKaraoke (Typewritten)
FID76.03
8
Line-level Text-to-Image SynthesisKaraoke Handwritten (test)
FID67.16
8
Showing 10 of 11 rows

Other info

Follow for update