Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

About

In this paper, we propose a novel method called Rotational Region CNN (R2CNN) for detecting arbitrary-oriented texts in natural scene images. The framework is based on Faster R-CNN [1] architecture. First, we use the Region Proposal Network (RPN) to generate axis-aligned bounding boxes that enclose the texts with different orientations. Second, for each axis-aligned text box proposed by RPN, we extract its pooled features with different pooled sizes and the concatenated features are used to simultaneously predict the text/non-text score, axis-aligned box and inclined minimum area box. At last, we use an inclined non-maximum suppression to get the detection results. Our approach achieves competitive results on text detection benchmarks: ICDAR 2015 and ICDAR 2013.

Yingying Jiang, Xiangyu Zhu, Xiaobing Wang, Shuli Yang, Wei Li, Hua Wang, Pei Fu, Zhenbo Luo• 2017

Related benchmarks

TaskDatasetResultRank
Oriented Object DetectionDOTA v1.0 (test)
SV59.92
378
Object DetectionDOTA 1.0 (test)
Plane AP80.94
256
Text DetectionICDAR 2015
Precision85.62
171
Scene Text DetectionICDAR 2015 (test)
F1 Score82.54
150
Oriented Text DetectionICDAR 2015 (test)
Precision85.6
129
Text DetectionICDAR 2015 (test)
F1 Score82.54
108
Oriented Object DetectionDOTA (test)
AP (Plane)88.52
92
Text DetectionICDAR 2013 (test)
F1 Score79.68
88
Object DetectionHRSC 2016 (test)
mAP@0.0773.07
72
Oriented Object DetectionHRSC 2016 (test)
mAP73.07
55
Showing 10 of 16 rows

Other info

Follow for update