Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Open-vocabulary Keyword-spotting with Adaptive Instance Normalization

About

Open vocabulary keyword spotting is a crucial and challenging task in automatic speech recognition (ASR) that focuses on detecting user-defined keywords within a spoken utterance. Keyword spotting methods commonly map the audio utterance and keyword into a joint embedding space to obtain some affinity score. In this work, we propose AdaKWS, a novel method for keyword spotting in which a text encoder is trained to output keyword-conditioned normalization parameters. These parameters are used to process the auditory input. We provide an extensive evaluation using challenging and diverse multi-lingual benchmarks and show significant improvements over recent keyword spotting and ASR baselines. Furthermore, we study the effectiveness of our approach on low-resource languages that were unseen during the training. The results demonstrate a substantial performance improvement compared to baseline methods.

Aviv Navon, Aviv Shamsian, Neta Glazer, Gill Hetz, Joseph Keshet• 2023

Related benchmarks

TaskDatasetResultRank
Keyword SpottingLibriPhrase Easy (LPE)
EER1.21
46
Speaker-Independent Keyword SpottingLibriPhrase hard
AUROC95.09
21
Keyword SpottingLibriPhrase Hard (LPH)
EER0.1347
20
Showing 3 of 3 rows

Other info

Follow for update