Rare Word Recognition and Translation Without Fine-Tuning via Task Vector in Speech Models

About

Rare words remain a critical bottleneck for speech-to-text systems. While direct fine-tuning improves recognition of target words, it often incurs high cost, catastrophic forgetting, and limited scalability. To address these challenges, we propose a training-free paradigm based on task vectors for rare word recognition and translation. By defining task vectors as parameter differences and introducing word-level task vector arithmetic, our approach enables flexible composition of rare-word capabilities, greatly enhancing scalability and reusability. Extensive experiments across multiple domains show that the proposed method matches or surpasses fine-tuned models on target words, improves general performance by about 5 BLEU, and mitigates catastrophic forgetting.

Ruihao Jing, Cheng Gong, Yu Jiang, Boyu Zhu, Shansong Liu, Chi Zhang, Xiao-Lei Zhang, Xuelong Li• 2025

Related benchmarks

Task	Dataset	Result
Chinese-to-English speech translation	W1 rare word dataset	BLEU66.77	6
Chinese-to-English speech translation	General-domain (test)	BLEU40.77	6
Chinese-to-English speech translation	W2 rare word	BLEU74.88	6
Chinese-to-English speech translation	W3 rare word	BLEU68.66	6
Chinese-to-English speech translation	W4 rare word	BLEU73.17	6
Chinese-to-English speech translation	W5 rare word dataset	BLEU60.01	6
Chinese-to-English speech translation	W7 rare word dataset	BLEU61.02	6
Chinese-to-English speech translation	W8 rare word	BLEU Score62.86	6
Chinese-to-English speech translation	W10 rare word	BLEU Score64.96	6
Chinese-to-English speech translation	W6 rare word	BLEU70.66	6

Showing 10 of 21 rows

Other info

Follow for update

@wizwand_team Discord