Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Text-to-Speech on ELLA-V hard sentences (test)
Loading...
4.29
WER (%)
E1 TTS DMD
4.1184
5.2767
6.435
7.5933
Oct 9, 2024
WER (%)
Substitution Rate (%)
Deletion Rate (%)
Insertion Rate (%)
Updated 4d ago
Evaluation Results
Method
Method
Links
WER (%)
Substitution Rate (%)
Deletion Rate (%)
Insertion Rate (%)
E1 TTS DMD
mode=one-step TTS, sou...
2024.10
4.29
1.89
1.62
0.74
F5-TTS
mode=zero-shot, NFE=32...
2024.10
4.4
1.81
2.4
0.18
StyleTTS 2
mode=zero-shot, source...
2024.10
4.83
2.17
2.03
0.61
CosyVoice
mode=zero-shot, source...
2024.10
8.3
3.47
2.74
1.93
E2 TTS
mode=zero-shot, reprod...
2024.10
8.58
3.7
4.82
0.06
Feedback
Search any
task
Search any
task