ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression

About

Noise suppression (NS) models have been widely applied to enhance speech quality. Recently, Deep Learning-Based NS, which we denote as Deep Noise Suppression (DNS), became the mainstream NS method due to its excelling performance over traditional ones. However, DNS models face 2 major challenges for supporting the real-world applications. First, high-performing DNS models are usually large in size, causing deployment difficulties. Second, DNS models require extensive training data, including noisy audios as inputs and clean audios as labels. It is often difficult to obtain clean labels for training DNS models. We propose the use of knowledge distillation (KD) to resolve both challenges. Our study serves 2 main purposes. To begin with, we are among the first to comprehensively investigate mainstream KD techniques on DNS models to resolve the two challenges. Furthermore, we propose a novel Attention-Based-Compression KD method that outperforms all investigated mainstream KD frameworks on DNS task.

Yixin Wan, Yuan Zhou, Xiulian Peng, Kai-Wei Chang, Yan Lu• 2023

Related benchmarks

Task	Dataset	Result	Rank
Speech Enhancement	DNS no_reverb (test)	PESQ2.9		46
Speech Enhancement	L3DAS23 (dev)	WER16.9		17

Showing 2 of 2 rows

Other info

Follow for update

@wizwand_team Discord