R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection
About
In this paper, we propose a novel method called Rotational Region CNN (R2CNN) for detecting arbitrary-oriented texts in natural scene images. The framework is based on Faster R-CNN [1] architecture. First, we use the Region Proposal Network (RPN) to generate axis-aligned bounding boxes that enclose the texts with different orientations. Second, for each axis-aligned text box proposed by RPN, we extract its pooled features with different pooled sizes and the concatenated features are used to simultaneously predict the text/non-text score, axis-aligned box and inclined minimum area box. At last, we use an inclined non-maximum suppression to get the detection results. Our approach achieves competitive results on text detection benchmarks: ICDAR 2015 and ICDAR 2013.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Oriented Object Detection | DOTA v1.0 (test) | SV59.92 | 378 | |
| Object Detection | DOTA 1.0 (test) | Plane AP80.94 | 256 | |
| Text Detection | ICDAR 2015 | Precision85.62 | 171 | |
| Scene Text Detection | ICDAR 2015 (test) | F1 Score82.54 | 150 | |
| Oriented Text Detection | ICDAR 2015 (test) | Precision85.6 | 129 | |
| Text Detection | ICDAR 2015 (test) | F1 Score82.54 | 108 | |
| Oriented Object Detection | DOTA (test) | AP (Plane)88.52 | 92 | |
| Text Detection | ICDAR 2013 (test) | F1 Score79.68 | 88 | |
| Object Detection | HRSC 2016 (test) | mAP@0.0773.07 | 72 | |
| Oriented Object Detection | HRSC 2016 (test) | mAP73.07 | 55 |