Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Spoken Dialogue Generation on TTSD EN (test)
Loading...
96.55
Accuracy
MOSS-TTSD
89.998
91.699
93.4
95.101
Mar 20, 2026
Accuracy
Similarity
Word Error Rate
Updated 26d ago
Evaluation Results
Method
Method
Links
Accuracy
Similarity
Word Error Rate
MOSS-TTSD
Reference Speaker Timb...
2026.03
96.55
78.93
9.84
MOSS-TTSD
Context=Open-Source Mo...
2026.03
96.26
73.26
9.88
MOSS-TTSD
Reference Speaker Timb...
2026.03
95.65
73.04
10.05
VibeVoice
Model Size=7B
2026.03
95.54
71.4
9.46
gemini-2.5-pro-preview-tts
2026.03
95.37
67.86
8.59
gemini-2.5-flash-preview-tts
2026.03
95.11
71.94
8.71
Eleven V3
2026.03
94.98
67.3
8.24
VibeVoice
Model Size=1.5B
2026.03
93.53
69.61
11.33
Higgs Audio V2
2026.03
90.25
68.6
21.31
Feedback
Search any
task
Search any
task