Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Text-to-Speech on Common Voice en 15
Loading...
10.8
WER
Phi-4 Multimodal
10.228
14.089
17.95
21.811
Jan 15, 2026
WER
Updated 4d ago
Evaluation Results
Method
Method
Links
WER
Phi-4 Multimodal
2026.01
10.8
MoST
2026.01
11.5
SeamlessM4T-v2
2026.01
12.1
MinMo
2026.01
13.5
Moshi
2026.01
14.2
LLaMA-Omni2
2026.01
17.2
SpiritLM
2026.01
22.4
SpeechGPT
2026.01
23.2
AudioLM
2026.01
25.1
Feedback
Search any
task
Search any
task