Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Text-to-Speech on LibriSpeech 2.2 hours subset (test-clean)
Loading...
0.019
WER
Voicebox
0.018736
0.020518
0.0223
0.024082
Oct 9, 2024
WER
SIM-o
RTF
Updated 4d ago
Evaluation Results
Method
Method
Links
WER
SIM-o
RTF
Voicebox
#Param.=330M, #Data=60...
2024.10
0.019
0.662
0.64
MELLE
#Data=50K EN
2024.10
0.021
0.625
0.538
MELLE-R2
#Data=50K EN
2024.10
0.0214
0.608
0.276
Ground Truth
Subset=2.2 hours subset
2024.10
0.022
0.754
-
VALL-E 2
#Data=50K EN
2024.10
0.0244
0.643
0.732
DiTTO-TTS
#Param.=740M, #Data=55...
2024.10
0.0256
0.627
0.162
Feedback
Search any
task
Search any
task