Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TextMaster: A Unified Framework for Realistic Text Editing via Glyph-Style Dual-Control

About

In image editing tasks, high-quality text editing capabilities can significantly reduce both human and material resource costs. Existing methods, however, face significant limitations in terms of stroke accuracy for complex text and controllability of generated text styles. To address these challenges, we propose TextMaster, a solution capable of accurately editing text across various scenarios and image regions, while ensuring proper layout and controllable text style. Our method enhances the accuracy and fidelity of text rendering by incorporating high-resolution standard glyph information and applying perceptual loss within the text editing region. Additionally, we leverage an attention mechanism to compute intermediate layer bounding box regression loss for each character, enabling the model to learn text layout across varying contexts. Furthermore, we propose a novel style injection technique that enables controllable style transfer for the injected text. Through comprehensive experiments, we demonstrate the state-of-the-art performance of our method.

Zhenyu Yan, Jian Wang, Aoqiang Wang, Yuhan Li, Wenxiang Shang, Ran Lin• 2024

Related benchmarks

TaskDatasetResultRank
Text editingAnyText benchmark Chinese 1.0 (test)
Sen.ACC0.9257
8
Random text editingAnyText English v1
Accuracy93.58
7
Random text editingICDAR English 13
Accuracy (Acc)91
7
Random text editingTextMaster English
Accuracy91.8
7
Random text editingTextMaster Chinese
Accuracy91.8
3
Showing 5 of 5 rows

Other info

Follow for update