Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LJSpeech

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio WatermarkingLJSpeech
PESQ4.5486
88
Audio ReconstructionLJSpeech
UTMOS4.3794
26
Text-to-SpeechLJSpeech (test)
CMOS0.934
20
Speech WatermarkingLJSpeech 2017
STOI0.9996
17
Speech WatermarkingLJSpeech (In-Distribution)
MP3 (16 kbps) Acc0.9984
13
Speech WatermarkingLJSpeech (in-distribution)
Gaussian Noise (5 dB) Score0.9986
13
Neural VocodingLJSpeech 1.1 (test)
M-STFT0.9
12
Neural VocodingLJSpeech 88 (test)
M-STFT0.9
12
Waveform GenerationLJSpeech
UTMOS4.3894
12
Speech SynthesisLJSpeech
MOS4.45
12
Lossless Data CompressionLJSpeech
Compression Ratio1.88
11
Audio SynthesisLJSpeech (unseen)
MAE0.1102
10
Audio GenerationLJSpeech Short-Term (test)
FAD0.911
9
Neural VocodingLJSpeech
MOS4.49
9
Neural VocodingLJSpeech (Long Audio)
MOS4.73
8
Neural VocodingLJSpeech Short Audio
MOS3.67
8
Waveform GenerationLJSpeech (test)
M-STFT0.9369
8
Generative Speech WatermarkingLJSpeech (test)
Inference Time (ms)13.48
7
Voice ConversionLJSpeech target speaker
WER3.22
7
Text-to-SpeechLJSpeech
WER3.37
6
Speech reconstructionLJSpeech ID
MCD4.42
6
Audio SynthesisLJSpeech (test)
GPU Execution Time4.82
6
Speech SynthesisLJSpeech (test)
RTF0.011
6
Lossless Audio CompressionLJSpeech 16-bit
Compression Rate2.08
5
Speech SynthesisLJSpeech
PESQ4.235
5
Showing 25 of 38 rows