Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AON: Towards Arbitrarily-Oriented Text Recognition

About

Recognizing text from natural images is a hot research topic in computer vision due to its various applications. Despite the enduring research of several decades on optical character recognition (OCR), recognizing texts from natural images is still a challenging task. This is because scene texts are often in irregular (e.g. curved, arbitrarily-oriented or seriously distorted) arrangements, which have not yet been well addressed in the literature. Existing methods on text recognition mainly work with regular (horizontal and frontal) texts and cannot be trivially generalized to handle irregular texts. In this paper, we develop the arbitrary orientation network (AON) to directly capture the deep features of irregular texts, which are combined into an attention-based decoder to generate character sequence. The whole network can be trained end-to-end by using only images and word-level annotations. Extensive experiments on various benchmarks, including the CUTE80, SVT-Perspective, IIIT5k, SVT and ICDAR datasets, show that the proposed AON-based method achieves the-state-of-the-art performance in irregular datasets, and is comparable to major existing methods in regular datasets.

Zhanzhan Cheng, Yangliu Xu, Fan Bai, Yi Niu, Shiliang Pu, Shuigeng Zhou• 2017

Related benchmarks

TaskDatasetResultRank
Scene Text RecognitionSVT (test)
Word Accuracy96
289
Scene Text RecognitionIIIT5K (test)
Word Accuracy99.6
244
Scene Text RecognitionIC15 (test)
Word Accuracy68.2
210
Scene Text RecognitionSVTP (test)
Word Accuracy73
153
Scene Text RecognitionIIIT5K
Accuracy99.6
149
Scene Text RecognitionSVT 647 (test)
Accuracy82.8
101
Scene Text RecognitionCUTE
Accuracy76.8
92
Scene Text RecognitionCUTE80 (test)
Accuracy0.768
87
Scene Text RecognitionIC15
Accuracy68.2
86
Text RecognitionStreet View Text (SVT)
Accuracy96.9
80
Showing 10 of 35 rows

Other info

Follow for update