Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MSR: Multi-Scale Shape Regression for Scene Text Detection

About

State-of-the-art scene text detection techniques predict quadrilateral boxes that are prone to localization errors while dealing with straight or curved text lines of different orientations and lengths in scenes. This paper presents a novel multi-scale shape regression network (MSR) that is capable of locating text lines of different lengths, shapes and curvatures in scenes. The proposed MSR detects scene texts by predicting dense text boundary points that inherently capture the location and shape of text lines accurately and are also more tolerant to the variation of text line length as compared with the state of the arts using proposals or segmentation. Additionally, the multi-scale network extracts and fuses features at different scales which demonstrates superb tolerance to the text scale variation. Extensive experiments over several public datasets show that the proposed MSR obtains superior detection performance for both curved and straight text lines of different lengths and orientations.

Chuhui Xue, Shijian Lu, Wei Zhang• 2019

Related benchmarks

TaskDatasetResultRank
Text DetectionCTW1500 (test)
Precision85
157
Text DetectionTotal-Text
Recall73
139
Oriented Text DetectionICDAR 2015 (test)
Precision86.6
129
Text DetectionTotal-Text (test)
F-Measure79
126
Text DetectionICDAR 2015 (test)
F1 Score82.3
108
Scene Text DetectionTotalText (test)
Recall74.8
106
Text DetectionMSRA-TD500
Precision87.4
84
Text DetectionMSRA-TD500 (test)
Precision87.4
70
Text DetectionCTW1500--
70
Scene Text DetectionMSRA-TD500 (test)
Precision87.4
65
Showing 10 of 13 rows

Other info

Follow for update