Schr\"odinger Bridge Mamba for One-Step Speech Enhancement

About

We present Schr\"odinger Bridge Mamba (SBM), a novel model for efficient speech enhancement by integrating the Schr\"odinger Bridge (SB) training paradigm and the Mamba architecture. Experiments of joint denoising and dereverberation tasks demonstrate SBM outperforms strong generative and discriminative methods on multiple metrics with only one step of inference while achieving a competitive real-time factor for streaming feasibility. Ablation studies reveal that the SB paradigm consistently yields improved performance across diverse architectures over conventional mapping. Furthermore, Mamba exhibits a stronger performance under the SB paradigm compared to Multi-Head Self-Attention (MHSA) and Long Short-Term Memory (LSTM) backbones. These findings highlight the synergy between the Mamba architecture and the SB trajectory-based training, providing a high-quality solution for real-world speech enhancement. Demo page: https://sbmse.github.io

Jing Yang, Sirui Wang, Chao Wu, Lei Guo, Fan Fan• 2025

Related benchmarks

Task	Dataset	Result
Speech Enhancement	VoiceBank-DEMAND (test)	PESQ3.503	201
Speech Enhancement	DNS no_reverb (test)	PESQ2.825	46
Speech Enhancement	DNS Challenge Real Recordings (test)	SIG Score3.459	41
Speech Enhancement	DNS with reverb (test)	STOI71	27

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord