Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement

About

Generative adversarial networks have recently demonstrated outstanding performance in neural vocoding outperforming best autoregressive and flow-based models. In this paper, we show that this success can be extended to other tasks of conditional audio generation. In particular, building upon HiFi vocoders, we propose a novel HiFi++ general framework for bandwidth extension and speech enhancement. We show that with the improved generator architecture, HiFi++ performs better or comparably with the state-of-the-art in these tasks while spending significantly less computational resources. The effectiveness of our approach is validated through a series of extensive experiments.

Pavel Andreev, Aibek Alanov, Oleg Ivanov, Dmitry Vetrov• 2022

Related benchmarks

TaskDatasetResultRank
Bandwidth extensionVCTK-BWE BW=2K (test)
WVMOS3.95
7
Bandwidth extensionVCTK-BWE BW=4K (test)
WVMOS4.16
7
Audio compressionVoxCeleb (test)
MEL Score4.35
6
Bandwidth extensionVCTK-BWE BW=1K (test)
WVMOS3.71
6
Bandwidth extensionVCTK
CSIG3.51
4
Showing 5 of 5 rows

Other info

Follow for update