Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Speech Synthesis on ESD Zh
Loading...
2.4
WER
Spark-TTS
2.228
3.389
4.55
5.711
Oct 16, 2025
WER
SIM-O
CMOS
Emotion MOS
Neutral Score
Happy Score
Sad Score
Angry Score
Surprise Score
Average Score
Updated 11d ago
Evaluation Results
Method
Method
Links
WER
SIM-O
CMOS
Emotion MOS
Neutral Score
Happy Score
Sad Score
Angry Score
Surprise Score
Average Score
Spark-TTS
Backbone=Qwen2.5
2025.10
2.4
0.7
6.2
5.99
82.29
5.56
0
4
0
18.37
RLAIF-SPA
Backbone=MiniCPM-O 2.6
2025.10
3.68
0.74
6.5
6.29
89.71
7.14
4.57
4.86
3.14
21.88
MegaTTS3
2025.10
3.86
0.72
6.4
6.28
54
51.14
0
0.86
0
21.2
F5-TTS
2025.10
4.01
0.69
6.45
6.06
87.71
0
0
10.29
0.57
19.71
Chat-TTS
2025.10
6.7
0.67
6.06
5.92
63.14
6.35
2.57
2.57
2.86
15.5
Feedback
Search any
task
Search any
task