Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SELECT: Detecting Label Errors in Real-world Scene Text Data

About

We introduce SELECT (Scene tExt Label Errors deteCTion), a novel approach that leverages multi-modal training to detect label errors in real-world scene text datasets. Utilizing an image-text encoder and a character-level tokenizer, SELECT addresses the issues of variable-length sequence labels, label sequence misalignment, and character-level errors, outperforming existing methods in accuracy and practical utility. In addition, we introduce Similarity-based Sequence Label Corruption (SSLC), a process that intentionally introduces errors into the training labels to mimic real-world error scenarios during training. SSLC not only can cause a change in the sequence length but also takes into account the visual similarity between characters during corruption. Our method is the first to detect label errors in real-world scene text datasets successfully accounting for variable-length labels. Experimental results demonstrate the effectiveness of SELECT in detecting label errors and improving STR accuracy on real-world text datasets, showcasing its practical utility.

Wenjun Liu, Qian Wu, Yifeng Hu, Yuke Li• 2025

Related benchmarks

TaskDatasetResultRank
Label Error Detectioncorrupted MJSynth (test)
Precision98.33
3
Showing 1 of 1 rows

Other info

Follow for update