TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge

About

This paper introduces the Unbeatable Team's submission to the ICASSP 2023 Deep Noise Suppression (DNS) Challenge. We expand our previous work, TEA-PSE, to its upgraded version -- TEA-PSE 3.0. Specifically, TEA-PSE 3.0 incorporates a residual LSTM after squeezed temporal convolution network (S-TCN) to enhance sequence modeling capabilities. Additionally, the local-global representation (LGR) structure is introduced to boost speaker information extraction, and multi-STFT resolution loss is used to effectively capture the time-frequency characteristics of the speech signals. Moreover, retraining methods are employed based on the freeze training strategy to fine-tune the system. According to the official results, TEA-PSE 3.0 ranks 1st in both ICASSP 2023 DNS-Challenge track 1 and track 2.

Yukai Ju, Jun Chen, Shimin Zhang, Shulin He, Wei Rao, Weixin Zhu, Yannan Wang, Tao Yu, Shidong Shang• 2023

Related benchmarks

Task	Dataset	Result
Personalized Speech Enhancement	DNS Track 2: Speakerphone Blind 5 (test)	SIG Score3.99	19
Personalized Speech Enhancement	DNS Track 1: Headset 5 (test)	SIG Score4.12	19
Target Speaker Extraction	ICASSP DNS-challenge Track 1 - Headset 2023 (test)	SIG Score4.12	5
Target Speaker Extraction	ICASSP DNS-challenge Track 2 - Speakerphone 2023 (test)	SIG Score3.99	5

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord