Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Speech Recognition on LongSpeech
Loading...
11
WER
Upper-bound
9.392
20.246
31.1
41.954
Feb 5, 2026
WER
Updated 4d ago
Evaluation Results
Method
Method
Links
WER
Upper-bound
compression=uncompressed
2026.02
11
Speech-XL
compression=SST mechanism
2026.02
11.4
Whisper
mode=zero-shot
2026.02
17
Voxtral
mode=zero-shot
2026.02
27.5
Qwen2-Audio
mode=zero-shot
2026.02
29.9
AudioFlamingo3
mode=zero-shot
2026.02
34.5
DashengLM
mode=zero-shot
2026.02
35.5
Kimi-audio
mode=zero-shot
2026.02
51.2
Feedback
Search any
task
Search any
task