Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Deep Filter Estimation from Inter-Frame Correlations for Monaural Speech Dereverberation

About

Speech dereverberation in distant-microphone scenarios remains challenging due to the high correlation between reverberation and target signals, often leading to poor generalization in real-world environments. We propose IF-CorrNet, a correlation-to-filter architecture designed for robustness against acoustic variability. Unlike conventional black-box mapping methods that directly estimate complex spectra, IF-CorrNet explicitly exploits inter-frame STFT correlations to estimate multi-frame deep filters for each time-frequency bin. By shifting the learning objective from direct mapping to filter estimation, the network effectively constrains the solution space, which simplifies the training process and mitigates overfitting to synthetic data. Experimental results on the REVERB Challenge dataset demonstrate that IF-CorrNet achieves a substantial gain in the SRMR metric on RealData, confirming its robustness in suppressing reverberation and noise in practical, non-synthetic environments.

Ui-Hyeop Shin, Jun Hyung Kim, Jangyeon Kim, Wooseok Kim, Hyung-Min Park• 2026

Related benchmarks

TaskDatasetResultRank
Speech DereverberationREVERB Challenge SimData (test)
CD1.899
16
Speech DereverberationREVERB Challenge Evaluation Set (RealData)
SRMR7.548
16
Showing 2 of 2 rows

Other info

Follow for update