Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Speech Generation on Long-Audio benchmark English
Loading...
4.38
WER
Fish Audio S2
3.4352
9.8126
16.19
22.5674
Mar 9, 2026
WER
SIM-Mean
SIM-Std
Updated 1mo ago
Evaluation Results
Method
Method
Links
WER
SIM-Mean
SIM-Std
Fish Audio S2
2026.03
4.38
0.523
0.0761
Fish Audio S1
2026.03
6.26
0.436
0.108
Qwen3-TTS
2026.03
7.69
0.39
0.0737
VibeVoice
2026.03
28
0.53
0.0572
Feedback
Search any
task
Search any
task