SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition

About

Unconstrained handwriting recognition is an essential task in document analysis. It is usually carried out in two steps. First, the document is segmented into text lines. Second, an Optical Character Recognition model is applied on these line images. We propose the Simple Predict & Align Network: an end-to-end recurrence-free Fully Convolutional Network performing OCR at paragraph level without any prior segmentation stage. The framework is as simple as the one used for the recognition of isolated lines and we achieve competitive results on three popular datasets: RIMES, IAM and READ 2016. The proposed model does not require any dataset adaptation, it can be trained from scratch, without segmentation labels, and it does not require line breaks in the transcription labels. Our code and trained model weights are available at https://github.com/FactoDeepLearning/SPAN.

Denis Coquenet, Cl\'ement Chatelain, Thierry Paquet• 2021

Related benchmarks

Task	Dataset	Result
Handwritten text recognition	IAM (test)	CER5.45	102
Handwritten text recognition	RIMES	Character Error Rate (CER)3.81	26
Handwritten text recognition	READ 2016 (test)	CER4.6	23
Handwritten text recognition	IAM	Character Error Rate (CER)5.45	12
Handwritten text recognition	IAM (val)	CER3.57	9
Handwriting Recognition	RIMES (val)	CER3.56	8
Handwritten text recognition	RIMES line level (test)	CER3.81	5
Paragraph Recognition	RIMES (test)	CER4.17	4
Paragraph-level Handwriting Recognition	RIMES 2011 (test)	CER4.17	4
Handwritten Document Recognition	READ Paragraph level 2016 (test)	CER6.2	3

Showing 10 of 12 rows

Other info

Code

Follow for update

@wizwand_team Discord