R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

About

In this paper, we propose a novel method called Rotational Region CNN (R2CNN) for detecting arbitrary-oriented texts in natural scene images. The framework is based on Faster R-CNN [1] architecture. First, we use the Region Proposal Network (RPN) to generate axis-aligned bounding boxes that enclose the texts with different orientations. Second, for each axis-aligned text box proposed by RPN, we extract its pooled features with different pooled sizes and the concatenated features are used to simultaneously predict the text/non-text score, axis-aligned box and inclined minimum area box. At last, we use an inclined non-maximum suppression to get the detection results. Our approach achieves competitive results on text detection benchmarks: ICDAR 2015 and ICDAR 2013.

Yingying Jiang, Xiangyu Zhu, Xiaobing Wang, Shuli Yang, Wei Li, Hua Wang, Pei Fu, Zhenbo Luo• 2017

Related benchmarks

Task	Dataset	Result
Oriented Object Detection	DOTA v1.0 (test)	SV59.92	395
Object Detection	DOTA 1.0 (test)	Plane AP80.94	256
Text Detection	ICDAR 2015	Precision85.62	188
Scene Text Detection	ICDAR 2015 (test)	F1 Score82.54	150
Oriented Text Detection	ICDAR 2015 (test)	Precision85.6	129
Text Detection	ICDAR 2015 (test)	F1 Score82.54	108
Oriented Object Detection	DOTA (test)	AP (Plane)88.52	92
Text Detection	ICDAR 2013 (test)	F1 Score79.68	88
Object Detection	HRSC 2016 (test)	mAP@0.0773.07	72
Oriented Object Detection	HRSC 2016 (test)	mAP73.07	55

Showing 10 of 16 rows

Other info

Follow for update

@wizwand_team Discord