Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Schr\"odinger Bridge Mamba for One-Step Speech Enhancement

About

We present Schr\"odinger Bridge Mamba (SBM), a novel model for efficient speech enhancement by integrating the Schr\"odinger Bridge (SB) training paradigm and the Mamba architecture. Experiments of joint denoising and dereverberation tasks demonstrate SBM outperforms strong generative and discriminative methods on multiple metrics with only one step of inference while achieving a competitive real-time factor for streaming feasibility. Ablation studies reveal that the SB paradigm consistently yields improved performance across diverse architectures over conventional mapping. Furthermore, Mamba exhibits a stronger performance under the SB paradigm compared to Multi-Head Self-Attention (MHSA) and Long Short-Term Memory (LSTM) backbones. These findings highlight the synergy between the Mamba architecture and the SB trajectory-based training, providing a high-quality solution for real-world speech enhancement. Demo page: https://sbmse.github.io

Jing Yang, Sirui Wang, Chao Wu, Lei Guo, Fan Fan• 2025

Related benchmarks

TaskDatasetResultRank
Speech EnhancementVoiceBank-DEMAND (test)
PESQ3.503
128
Speech EnhancementDNS Challenge Real Recordings (test)
SIG Score3.459
32
Speech EnhancementDNS no_reverb (test)
PESQ2.825
27
Speech EnhancementDNS with reverb (test)
STOI71
27
Showing 4 of 4 rows

Other info

Follow for update