Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LJSpeech

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio WatermarkingLJSpeech
PESQ4.5486
88
Audio ReconstructionLJSpeech
UTMOS4.3794
26
Text-to-SpeechLJSpeech (test)
CMOS0.934
20
Speech WatermarkingLJSpeech 2017
STOI0.9996
17
Speech WatermarkingLJSpeech (In-Distribution)
MP3 (16 kbps) Acc0.9984
13
Speech WatermarkingLJSpeech (in-distribution)
Gaussian Noise (5 dB) Score0.9986
13
Waveform GenerationLJSpeech
UTMOS4.3894
12
Speech SynthesisLJSpeech
MOS4.45
12
Audio SynthesisLJSpeech (unseen)
MAE0.1102
10
Neural VocodingLJSpeech
MOS4.49
9
Waveform GenerationLJSpeech (test)
M-STFT0.9369
8
Generative Speech WatermarkingLJSpeech (test)
Inference Time (ms)13.48
7
Voice ConversionLJSpeech target speaker
WER3.22
7
Speech reconstructionLJSpeech ID
MCD4.42
6
Audio SynthesisLJSpeech (test)
GPU Execution Time4.82
6
Speech SynthesisLJSpeech (test)
RTF0.011
6
Waveform SynthesisLJSpeech
Training Time (h)17.02
4
Speech SynthesisLJSpeech 26 (test)
PESQ3.807
3
Text-to-SpeechLJSpeech
MAE0.131
3
Text-to-SpeechLJSpeech low-resource setting
Intelligibility Rate97
3
Audio SynthesisLJSpeech 44.1kHz (test)
GPU xRT152.58
2
Speech SynthesisLJSpeech
CMOS-N Score1.07
2
Mel-spectrogram generationLJSpeech (test)
Speedup269.4
1
Showing 23 of 23 rows