Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes

About

Previous scene text detection methods have progressed substantially over the past years. However, limited by the receptive field of CNNs and the simple representations like rectangle bounding box or quadrangle adopted to describe text, previous methods may fall short when dealing with more challenging text instances, such as extremely long text and arbitrarily shaped text. To address these two problems, we present a novel text detector namely LOMO, which localizes the text progressively for multiple times (or in other word, LOok More than Once). LOMO consists of a direct regressor (DR), an iterative refinement module (IRM) and a shape expression module (SEM). At first, text proposals in the form of quadrangle are generated by DR branch. Next, IRM progressively perceives the entire long text by iterative refinement based on the extracted feature blocks of preliminary proposals. Finally, a SEM is introduced to reconstruct more precise representation of irregular text by considering the geometry properties of text instance, including text region, text center line and border offsets. The state-of-the-art results on several public benchmarks including ICDAR2017-RCTW, SCUT-CTW1500, Total-Text, ICDAR2015 and ICDAR17-MLT confirm the striking robustness and effectiveness of LOMO.

Chengquan Zhang, Borong Liang, Zuming Huang, Mengyi En, Junyu Han, Errui Ding, Xinghao Ding• 2019

Related benchmarks

TaskDatasetResultRank
Text DetectionICDAR 2015
Precision91.3
171
Text DetectionCTW1500 (test)
Precision89.2
157
Scene Text DetectionICDAR 2015 (test)
F1 Score87.2
150
Text DetectionTotal-Text
Recall79.3
139
Oriented Text DetectionICDAR 2015 (test)
Precision91.3
129
Text DetectionTotal-Text (test)
F-Measure83.3
126
Text DetectionICDAR 2015 (test)
F1 Score87.2
108
Scene Text DetectionTotalText (test)
Recall79.3
106
Text DetectionCTW1500
F-measure80.8
70
Scene Text DetectionTotal-Text
Precision87.6
63
Showing 10 of 16 rows

Other info

Follow for update