Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression

About

Noise suppression (NS) models have been widely applied to enhance speech quality. Recently, Deep Learning-Based NS, which we denote as Deep Noise Suppression (DNS), became the mainstream NS method due to its excelling performance over traditional ones. However, DNS models face 2 major challenges for supporting the real-world applications. First, high-performing DNS models are usually large in size, causing deployment difficulties. Second, DNS models require extensive training data, including noisy audios as inputs and clean audios as labels. It is often difficult to obtain clean labels for training DNS models. We propose the use of knowledge distillation (KD) to resolve both challenges. Our study serves 2 main purposes. To begin with, we are among the first to comprehensively investigate mainstream KD techniques on DNS models to resolve the two challenges. Furthermore, we propose a novel Attention-Based-Compression KD method that outperforms all investigated mainstream KD frameworks on DNS task.

Yixin Wan, Yuan Zhou, Xiulian Peng, Kai-Wei Chang, Yan Lu• 2023

Related benchmarks

TaskDatasetResultRank
Speech EnhancementDNS no_reverb (test)
PESQ2.9
46
Speech EnhancementL3DAS23 (dev)
WER16.9
17
Showing 2 of 2 rows

Other info

Follow for update