Multi-Oriented Text Detection with Fully Convolutional Networks
About
In this paper, we propose a novel approach for text detec- tion in natural images. Both local and global cues are taken into account for localizing text lines in a coarse-to-fine pro- cedure. First, a Fully Convolutional Network (FCN) model is trained to predict the salient map of text regions in a holistic manner. Then, text line hypotheses are estimated by combining the salient map and character components. Fi- nally, another FCN classifier is used to predict the centroid of each character, in order to remove the false hypotheses. The framework is general for handling text in multiple ori- entations, languages and fonts. The proposed method con- sistently achieves the state-of-the-art performance on three text detection benchmarks: MSRA-TD500, ICDAR2015 and ICDAR2013.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Text Detection | ICDAR 2015 | Precision71 | 171 | |
| Scene Text Detection | ICDAR 2015 (test) | F1 Score54 | 150 | |
| Oriented Text Detection | ICDAR 2015 (test) | Precision71 | 129 | |
| Text Detection | ICDAR 2013 (test) | F1 Score83.8 | 88 | |
| Text Detection | MSRA-TD500 | Precision83 | 84 | |
| Text Detection | MSRA-TD500 (test) | Precision83 | 70 | |
| Scene Text Detection | MSRA-TD500 (test) | Precision83 | 65 | |
| Text Detection | ICDAR Incidental Text 2015 (test) | Precision71 | 52 | |
| Text Localization | ICDAR 2013 (test) | Recall78 | 28 | |
| Text Detection | ICDAR Incidental Text 2015 | F-measure54 | 9 |