Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

2D Attentional Irregular Scene Text Recognizer

About

Irregular scene text, which has complex layout in 2D space, is challenging to most previous scene text recognizers. Recently, some irregular scene text recognizers either rectify the irregular text to regular text image with approximate 1D layout or transform the 2D image feature map to 1D feature sequence. Though these methods have achieved good performance, the robustness and accuracy are still limited due to the loss of spatial information in the process of 2D to 1D transformation. Different from all of previous, we in this paper propose a framework which transforms the irregular text with 2D layout to character sequence directly via 2D attentional scheme. We utilize a relation attention module to capture the dependencies of feature maps and a parallel attention module to decode all characters in parallel, which make our method more effective and efficient. Extensive experiments on several public benchmarks as well as our collected multi-line text dataset show that our approach is effective to recognize regular and irregular scene text and outperforms previous methods both in accuracy and speed.

Pengyuan Lyu, Zhicheng Yang, Xinhang Leng, Xiaojun Wu, Ruiyu Li, Xiaoyong Shen• 2019

Related benchmarks

TaskDatasetResultRank
Scene Text RecognitionSVT (test)
Word Accuracy90.1
289
Scene Text RecognitionIIIT5K (test)
Word Accuracy94
244
Scene Text RecognitionIC15 (test)
Word Accuracy76.3
210
Scene Text RecognitionIC13 (test)
Word Accuracy92.7
207
Scene Text RecognitionSVTP (test)
Word Accuracy82.3
153
Scene Text RecognitionCUTE
Accuracy86.8
92
Scene Text RecognitionIC15
Accuracy76.3
86
Scene Text RecognitionSVT
Accuracy90.1
67
Scene Text RecognitionIC13
Accuracy92.7
66
Scene Text RecognitionCUTE (test)
Accuracy86.8
59
Showing 10 of 13 rows

Other info

Follow for update