Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Rare Word Recognition and Translation Without Fine-Tuning via Task Vector in Speech Models

About

Rare words remain a critical bottleneck for speech-to-text systems. While direct fine-tuning improves recognition of target words, it often incurs high cost, catastrophic forgetting, and limited scalability. To address these challenges, we propose a training-free paradigm based on task vectors for rare word recognition and translation. By defining task vectors as parameter differences and introducing word-level task vector arithmetic, our approach enables flexible composition of rare-word capabilities, greatly enhancing scalability and reusability. Extensive experiments across multiple domains show that the proposed method matches or surpasses fine-tuned models on target words, improves general performance by about 5 BLEU, and mitigates catastrophic forgetting.

Ruihao Jing, Cheng Gong, Yu Jiang, Boyu Zhu, Shansong Liu, Chi Zhang, Xiao-Lei Zhang, Xuelong Li• 2025

Related benchmarks

TaskDatasetResultRank
Chinese-to-English speech translationW1 rare word dataset
BLEU66.77
6
Chinese-to-English speech translationGeneral-domain (test)
BLEU40.77
6
Chinese-to-English speech translationW2 rare word
BLEU74.88
6
Chinese-to-English speech translationW3 rare word
BLEU68.66
6
Chinese-to-English speech translationW4 rare word
BLEU73.17
6
Chinese-to-English speech translationW5 rare word dataset
BLEU60.01
6
Chinese-to-English speech translationW7 rare word dataset
BLEU61.02
6
Chinese-to-English speech translationW8 rare word
BLEU Score62.86
6
Chinese-to-English speech translationW10 rare word
BLEU Score64.96
6
Chinese-to-English speech translationW6 rare word
BLEU70.66
6
Showing 10 of 21 rows

Other info

Follow for update