Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PixelLink: Detecting Scene Text via Instance Segmentation

About

Most state-of-the-art scene text detection algorithms are deep learning based methods that depend on bounding box regression and perform at least two kinds of predictions: text/non-text classification and location regression. Regression plays a key role in the acquisition of bounding boxes in these methods, but it is not indispensable because text/non-text prediction can also be considered as a kind of semantic segmentation that contains full location information in itself. However, text instances in scene images often lie very close to each other, making them very difficult to separate via semantic segmentation. Therefore, instance segmentation is needed to address this problem. In this paper, PixelLink, a novel scene text detection algorithm based on instance segmentation, is proposed. Text instances are first segmented out by linking pixels within the same instance together. Text bounding boxes are then extracted directly from the segmentation result without location regression. Experiments show that, compared with regression-based methods, PixelLink can achieve better or comparable performance on several benchmarks, while requiring many fewer training iterations and less training data.

Dan Deng, Haifeng Liu, Xuelong Li, Deng Cai• 2018

Related benchmarks

TaskDatasetResultRank
Text DetectionICDAR 2015
Precision85.5
171
Scene Text DetectionICDAR 2015 (test)
F1 Score83.7
150
Oriented Text DetectionICDAR 2015 (test)
Precision85.5
129
Text DetectionICDAR 2013 (test)
F1 Score88.1
88
Text DetectionMSRA-TD500
Precision83
84
Text DetectionMSRA-TD500 (test)
Precision83
70
Scene Text DetectionMSRA-TD500 (test)
Precision83
65
Scene Text DetectionMSRA-TD500
Precision83
28
Text DetectionDAST1500
Precision0.745
7
Showing 9 of 9 rows

Other info

Follow for update