Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection

About

Arbitrary shape text detection is a challenging task due to the high variety and complexity of scenes texts. In this paper, we propose a novel unified relational reasoning graph network for arbitrary shape text detection. In our method, an innovative local graph bridges a text proposal model via Convolutional Neural Network (CNN) and a deep relational reasoning network via Graph Convolutional Network (GCN), making our network end-to-end trainable. To be concrete, every text instance will be divided into a series of small rectangular components, and the geometry attributes (e.g., height, width, and orientation) of the small components will be estimated by our text proposal model. Given the geometry attributes, the local graph construction model can roughly establish linkages between different text components. For further reasoning and deducing the likelihood of linkages between the component and its neighbors, we adopt a graph-based network to perform deep relational reasoning on local graphs. Experiments on public available datasets demonstrate the state-of-the-art performance of our method.

Shi-Xue Zhang, Xiaobin Zhu, Jie-Bo Hou, Chang Liu, Chun Yang, Hongfa Wang, Xu-Cheng Yin• 2020

Related benchmarks

TaskDatasetResultRank
Text DetectionICDAR 2015
Precision88.5
171
Text DetectionCTW1500 (test)
Precision85.9
157
Scene Text DetectionICDAR 2015 (test)
F1 Score86.6
150
Text DetectionTotal-Text
Recall84.9
139
Oriented Text DetectionICDAR 2015 (test)
Precision88.5
129
Text DetectionTotal-Text (test)
F-Measure85.7
126
Text DetectionICDAR 2015 (test)
F1 Score86.6
108
Scene Text DetectionTotalText (test)
Recall84.9
106
Text DetectionMSRA-TD500
Precision88.1
84
Text DetectionMSRA-TD500 (test)
Precision88.1
70
Showing 10 of 19 rows

Other info

Follow for update