Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Text Prior Guided Scene Text Image Super-resolution

About

Scene text image super-resolution (STISR) aims to improve the resolution and visual quality of low-resolution (LR) scene text images, and consequently boost the performance of text recognition. However, most of existing STISR methods regard text images as natural scene images, ignoring the categorical information of text. In this paper, we make an inspiring attempt to embed categorical text prior into STISR model training. Specifically, we adopt the character probability sequence as the text prior, which can be obtained conveniently from a text recognition model. The text prior provides categorical guidance to recover high-resolution (HR) text images. On the other hand, the reconstructed HR image can refine the text prior in return. Finally, we present a multi-stage text prior guided super-resolution (TPGSR) framework for STISR. Our experiments on the benchmark TextZoom dataset show that TPGSR can not only effectively improve the visual quality of scene text images, but also significantly improve the text recognition accuracy over existing STISR methods. Our model trained on TextZoom also demonstrates certain generalization capability to the LR images in other datasets.

Jianqi Ma, Shi Guo, Lei Zhang• 2021

Related benchmarks

TaskDatasetResultRank
Scene Text Image Super-ResolutionTextZoom (medium)
PSNR19.17
20
Scene Text Image Super-ResolutionTextZoom easy
PSNR24.35
20
Scene Text Image Super-ResolutionTextZoom hard
PSNR20.06
20
Scene Text RecognitionTextZoom easy
ASTER78.9
11
Scene Text RecognitionTextZoom (medium)
ASTER Sequence Accuracy62.7
11
Scene Text RecognitionTextZoom hard
ASTER Score44.5
11
Scene Text RecognitionTextZoom (avg)
ASTER Score62.8
11
Scene Text Image Super-ResolutionTextZoom (average)
PSNR21.18
10
Showing 8 of 8 rows

Other info

Follow for update