Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram

About

In this work, we present CleanUNet 2, a speech denoising model that combines the advantages of waveform denoiser and spectrogram denoiser and achieves the best of both worlds. CleanUNet 2 uses a two-stage framework inspired by popular speech synthesis methods that consist of a waveform model and a spectrogram model. Specifically, CleanUNet 2 builds upon CleanUNet, the state-of-the-art waveform denoiser, and further boosts its performance by taking predicted spectrograms from a spectrogram denoiser as the input. We demonstrate that CleanUNet 2 outperforms previous methods in terms of various objective and subjective evaluations.

Zhifeng Kong, Wei Ping, Ambrish Dantrey, Bryan Catanzaro• 2023

Related benchmarks

TaskDatasetResultRank
Speech DenoisingDNS no-reverb (test)
PESQ (WB)3.262
16
Speech DenoisingDNS
RTF9.91e-4
4
Showing 2 of 2 rows

Other info

Code

Follow for update