Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

EARS

Benchmarks

Task NameDataset NameSOTA ResultTrend
VDM reconstructionEARS RT60 = 0.6 s (test)
SDR19.7
13
VDM reconstructionEARS RT60 = 0.4 s (test)
SDR20.37
13
VDM reconstructionEARS RT60 = 0.2 s (test)
SDR22.12
13
Neural VocodingEARS (out-of-domain)
UTMOS3.3
9
Zero-shot Text-to-SpeechEARS (unseen speakers)
WER1.65
7
Speech SynthesisEARS
PESQ4.238
6
Automatic Speech RecognitionEARS-Reverb
WER (%)3.83
6
Diffuse sound extractionEARS RT60 = 0.6 s (test)
SDR8.22
5
Diffuse sound extractionEARS RT60 = 0.4 s (test)
SDR7.26
5
Diffuse sound extractionEARS RT60 = 0.2 s (test)
SDR3.99
5
Showing 10 of 10 rows