E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
About
An end-to-end trainable (fully differentiable) method for multi-language scene text localization and recognition is proposed. The approach is based on a single fully convolutional network (FCN) with shared layers for both tasks. E2E-MLT is the first published multi-language OCR for scene text. While trained in multi-language setup, E2E-MLT demonstrates competitive performance when compared to other methods trained for English scene text alone. The experiments show that obtaining accurate multi-language multi-script annotations is a challenging problem.
Michal Bu\v{s}ta, Yash Patel, Jiri Matas• 2018
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| End-to-end multilingual recognition | MLT 2019 (test) | F-measure (%)26.5 | 15 | |
| Joint Text Detection and Script Identification | MLT 2017 (test) | F1 Score58.7 | 8 |
Showing 2 of 2 rows