Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

End-to-end music source separation: is it possible in the waveform domain?

About

Most of the currently successful source separation techniques use the magnitude spectrogram as input, and are therefore by default omitting part of the signal: the phase. To avoid omitting potentially useful information, we study the viability of using end-to-end models for music source separation --- which take into account all the information available in the raw audio signal, including the phase. Although during the last decades end-to-end music source separation has been considered almost unattainable, our results confirm that waveform-based models can perform similarly (if not better) than a spectrogram-based deep learning model. Namely: a Wavenet-based model we propose and Wave-U-Net can outperform DeepConvSep, a recent spectrogram-based deep learning model.

Francesc Llu\'is, Jordi Pons, Xavier Serra• 2018

Related benchmarks

TaskDatasetResultRank
Music Source SeparationMUSDB18 (test)
SDR (Bass)3.21
69
Singing Voice Source SeparationMUSDB (test)
Vocals SDR4.49
11
Multi-instrument source separationMUSDB18 SiSEC (test)
Vocals SDR3.46
8
Showing 3 of 3 rows

Other info

Code

Follow for update