Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SEGAN: Speech Enhancement Generative Adversarial Network

About

Current speech enhancement techniques operate on the spectral domain and/or exploit some higher-level feature. The majority of them tackle a limited number of noise conditions and rely on first-order statistics. To circumvent these issues, deep networks are being increasingly used, thanks to their ability to learn complex functions from large example sets. In this work, we propose the use of generative adversarial networks for speech enhancement. In contrast to current techniques, we operate at the waveform level, training the model end-to-end, and incorporate 28 speakers and 40 different noise conditions into the same model, such that model parameters are shared across them. We evaluate the proposed model using an independent, unseen test set with two speakers and 20 alternative noise conditions. The enhanced samples confirm the viability of the proposed model, and both objective and subjective evaluations confirm the effectiveness of it. With that, we open the exploration of generative architectures for speech enhancement, which may progressively incorporate further speech-centric design choices to improve their performance.

Santiago Pascual, Antonio Bonafonte, Joan Serr\`a• 2017

Related benchmarks

TaskDatasetResultRank
Speech EnhancementVoiceBank + DEMAND (VB-DMD) (test)
PESQ2.16
105
Speech EnhancementVoiceBank-DEMAND (test)
PESQ2.16
96
Automatic Speech RecognitionATC Corpus
CER (DS2)9.74
27
Speech EnhancementATC Corpus
CSIG3.56
19
Audio GenerationMusic Noise SNR=-10 (test)
Generation Success Rate82.67
18
Audio GenerationSound Noise SNR=-10 (test)
Success Rate73.67
18
Audio GenerationSpeech Noise SNR=-10 (test)
Success Rate69
18
Speech EnhancementLRS3 mixed with VGGSound noises (test)
PESQ2.65
18
Speech EnhancementLRS3 mixed with QUT city-street noises (test)
PESQ2.61
18
Audio GenerationSound Clean (test)
Generation Success Rate87
18
Showing 10 of 29 rows

Other info

Follow for update