Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Transliterated Zero-Shot Domain Adaptation for Automatic Speech Recognition

About

The performance of automatic speech recognition models often degenerates on domains not covered by the training data. Domain adaptation can address this issue, assuming the availability of the target domain data in the target language. However, such assumption does not stand in many real-world applications. To make domain adaptation more applicable, we address the problem of zero-shot domain adaptation (ZSDA), where target domain data is unavailable in the target language. Instead, we transfer the target domain knowledge from another source language where the target domain data is more accessible. To do that, we first perform cross-lingual pre-training (XLPT) to share domain knowledge across languages, then use target language fine-tuning to build the final model. One challenge in this practice is that the pre-trained knowledge can be forgotten during fine-tuning, resulting in sub-optimal adaptation performance. To address this issue, we propose transliterated ZSDA to achieve consistent pre-training and fine-tuning labels, leading to maximum preservation of the pre-trained knowledge. Experimental results show that transliterated ZSDA relatively decreases the word error rate by 9.2% compared with a wav2vec 2.0 baseline. Moreover, transliterated ZSDA consistently outperforms self-supervised ZSDA and performs on par with supervised ZSDA, proving the superiority of transliteration-based pre-training labels.

Han Zhu, Gaofeng Cheng, Qingwei Zhao, Pengyuan Zhang• 2024

Related benchmarks

TaskDatasetResultRank
Script confusion mitigationFLEURS zh-trad (test)
Accuracy91
21
Script confusion mitigationFLEURS zh-sim (test)
Normalized Edit Similarity Accuracy93
21
Script confusion mitigationFLEURS sr-latn (test)
Accuracy (Normalized Edit Similarity)95
21
Script confusion mitigationFLEURS sr-cyrl (test)
Normalized Edit Similarity0.91
21
CyrillizationFLEURS Hindi (test)
Accuracy1
4
RomanizationFLEURS Hindi (test)
Accuracy2
4
CyrillizationFLEURS Greek (test)
Accuracy1
4
CyrillizationFLEURS Japanese (test)
Accuracy1
4
CyrillizationFLEURS Korean (test)
Accuracy1
4
CyrillizationFLEURS Italian (test)
Accuracy1
4
Showing 10 of 14 rows

Other info

Follow for update