Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LibriTTS

Benchmarks

Task NameDataset NameSOTA ResultTrend
Speech ReconstructionLibriTTS clean (test)
PESQ4.644
63
Speech ReconstructionLibriTTS (test-other)
UTMOS3.91
44
Text-to-SpeechLibriTTS clean (test)
WER0.018
30
Speech SynthesisLibriTTS (ID)
PESQ4.5
20
Neural VocodingLibriTTS (test)
PESQ4.269
18
Audio GenerationLibriTTS (dev)
M-STFT1.3647
18
Speech SynthesisLibriTTS (test)
MOS4.9134
17
Text-to-SpeechLibriTTS (test)
MOS4.54
16
Text-to-SpeechLibriTTS zero-shot
UTMOS4.3026
14
Waveform GenerationLibriTTS 24,000 Hz (test)
UTMOS3.7229
13
Zero-shot Text-to-SpeechLibriTTS (test)
SECS0.765
12
Waveform GenerationLibriTTS (dev)
M-STFT1.2129
12
Neural VocodingLibriTTS
UTMOS4.058
12
Speech SynthesisLibriTTS (dev)
M-STFT1.086
11
Voice ConversionLibriTTS (test-clean)
WER2.04
11
Speech SynthesisLibriTTS 24,000 Hz (test)
MOS4.28
11
Waveform GenerationLibriTTS-R clean (test)
Speech BERT Score100
10
Audio ReconstructionLibriTTS clean (test)
Mel Distance0.3442
10
VocodingLibriTTS (dev-other)
MAE0.0986
10
Neural VocodingLibriTTS clean (dev)
MAE0.0931
10
Audio WatermarkingLibriTTS
PESQ4.3289
8
Generative Speech WatermarkingLibriTTS OOD (test)
STOI0.9789
8
Speaker ErasureLibriTTS 1-speaker setting (forget test)
WER2.57
7
Speaker ErasureLibriTTS 1-speaker setting (retain test)
WER2.47
7
Accented Speech SynthesisLibriTTS-R (train-clean-100)
US Probability73.8
7
Showing 25 of 52 rows