Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Editing Text in the Wild

About

In this paper, we are interested in editing text in natural images, which aims to replace or modify a word in the source image with another one while maintaining its realistic look. This task is challenging, as the styles of both background and text need to be preserved so that the edited image is visually indistinguishable from the source image. Specifically, we propose an end-to-end trainable style retention network (SRNet) that consists of three modules: text conversion module, background inpainting module and fusion module. The text conversion module changes the text content of the source image into the target text while keeping the original text style. The background inpainting module erases the original text, and fills the text region with appropriate texture. The fusion module combines the information from the two former modules, and generates the edited text images. To our knowledge, this work is the first attempt to edit text in natural images at the word level. Both visual effects and quantitative results on synthetic and real-world dataset (ICDAR 2013) fully confirm the importance and necessity of modular decomposition. We also conduct extensive experiments to validate the usefulness of our method in various real-world applications such as text image synthesis, augmented reality (AR) translation, information hiding, etc.

Liang Wu, Chengquan Zhang, Jiaming Liu, Junyu Han, Jingtuo Liu, Errui Ding, Xiang Bai• 2019

Related benchmarks

TaskDatasetResultRank
Text Style Fidelity AssessmentScenePair Full-size Image
SSIM98.91
9
Scene Text EditingEnglish Scene Text Editing Dataset (test)
Sen.Acc39.94
8
Scene Text EditingEnglish ScenePair (test)
W.Acc16.64
7
Text rendering accuracyScenePair 1.0 (test)
Accuracy (%)17.84
6
Text rendering accuracyScenePair Random 1.0 (test)
Accuracy9.61
6
Text Style Fidelity AssessmentScenePair Cropped Text Image
SSIM26.66
6
Scene Text ErasureICDAR 2013 (test)
F1 Score0.0464
5
Text rendering accuracyTamperScene 1.0 (test)
Accuracy39.96
3
Showing 8 of 8 rows

Other info

Follow for update