Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DeRAGEC: Denoising Named Entity Candidates with Synthetic Rationale for ASR Error Correction

About

We present DeRAGEC, a method for improving Named Entity (NE) correction in Automatic Speech Recognition (ASR) systems. By extending the Retrieval-Augmented Generative Error Correction (RAGEC) framework, DeRAGEC employs synthetic denoising rationales to filter out noisy NE candidates before correction. By leveraging phonetic similarity and augmented definitions, it refines noisy retrieved NEs using in-context learning, requiring no additional training. Experimental results on CommonVoice and STOP datasets show significant improvements in Word Error Rate (WER) and NE hit ratio, outperforming baseline ASR and RAGEC methods. Specifically, we achieved a 28% relative reduction in WER compared to ASR without postprocessing. Our source code is publicly available at: https://github.com/solee0022/deragec

Solee Im, Wonjun Lee, Jinmyeong An, Yunsu Kim, Jungseul Ok, Gary Geunbae Lee• 2025

Related benchmarks

TaskDatasetResultRank
ASR Error CorrectionCommonVoice (CV) (test)
WER6
18
ASR Error CorrectionSTOP (test)
WER5.8
18
Showing 2 of 2 rows

Other info

Code

Follow for update