Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network

About

Scene text detection and recognition has received increasing research attention. Existing methods can be roughly categorized into two groups: character-based and segmentation-based. These methods either are costly for character annotation or need to maintain a complex pipeline, which is often not suitable for real-time applications. Here we address the problem by proposing the Adaptive Bezier-Curve Network (ABCNet). Our contributions are three-fold: 1) For the first time, we adaptively fit arbitrarily-shaped text by a parameterized Bezier curve. 2) We design a novel BezierAlign layer for extracting accurate convolution features of a text instance with arbitrary shapes, significantly improving the precision compared with previous methods. 3) Compared with standard bounding box detection, our Bezier curve detection introduces negligible computation overhead, resulting in superiority of our method in both efficiency and accuracy. Experiments on arbitrarily-shaped benchmark datasets, namely Total-Text and CTW1500, demonstrate that ABCNet achieves state-of-the-art accuracy, meanwhile significantly improving the speed. In particular, on Total-Text, our realtime version is over 10 times faster than recent state-of-the-art methods with a competitive recognition accuracy. Code is available at https://tinyurl.com/AdelaiDet

Yuliang Liu, Hao Chen, Chunhua Shen, Tong He, Lianwen Jin, Liangwei Wang• 2020

Related benchmarks

TaskDatasetResultRank
Text DetectionICDAR 2015
Precision90.4
171
Text DetectionCTW1500 (test)
Precision84.4
157
Scene Text DetectionICDAR 2015 (test)
F1 Score88.1
150
Text DetectionTotal-Text
Recall81.3
139
Text DetectionTotal-Text (test)
F-Measure87
126
Scene Text DetectionTotalText (test)
Recall84.1
106
Scene Text SpottingTotal-Text (test)
F-measure (None)73.5
105
End-to-End Text SpottingICDAR 2015
Strong Score82.7
80
Text DetectionCTW1500
F-measure84.7
70
Scene Text DetectionTotal-Text
Precision87.9
63
Showing 10 of 27 rows

Other info

Code

Follow for update