Rare Word Recognition and Translation Without Fine-Tuning via Task Vector in Speech Models
About
Rare words remain a critical bottleneck for speech-to-text systems. While direct fine-tuning improves recognition of target words, it often incurs high cost, catastrophic forgetting, and limited scalability. To address these challenges, we propose a training-free paradigm based on task vectors for rare word recognition and translation. By defining task vectors as parameter differences and introducing word-level task vector arithmetic, our approach enables flexible composition of rare-word capabilities, greatly enhancing scalability and reusability. Extensive experiments across multiple domains show that the proposed method matches or surpasses fine-tuned models on target words, improves general performance by about 5 BLEU, and mitigates catastrophic forgetting.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Chinese-to-English speech translation | W1 rare word dataset | BLEU66.77 | 6 | |
| Chinese-to-English speech translation | General-domain (test) | BLEU40.77 | 6 | |
| Chinese-to-English speech translation | W2 rare word | BLEU74.88 | 6 | |
| Chinese-to-English speech translation | W3 rare word | BLEU68.66 | 6 | |
| Chinese-to-English speech translation | W4 rare word | BLEU73.17 | 6 | |
| Chinese-to-English speech translation | W5 rare word dataset | BLEU60.01 | 6 | |
| Chinese-to-English speech translation | W7 rare word dataset | BLEU61.02 | 6 | |
| Chinese-to-English speech translation | W8 rare word | BLEU Score62.86 | 6 | |
| Chinese-to-English speech translation | W10 rare word | BLEU Score64.96 | 6 | |
| Chinese-to-English speech translation | W6 rare word | BLEU70.66 | 6 |