Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Speech-to-Text Question Answering on TriviaQA (Accuracy)
Loading...
38.5
Accuracy
Qwen2-Audio
2.828
12.089
21.35
30.611
Jan 30, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2-Audio
Type=AR, S→S capabilit...
2026.01
38.5
DiFFA
Type=Diff., S→S capabi...
2026.01
36
DIFFUSPEECH
Type=Diff., S→S capabi...
2026.01
33.5
Moshi
Type=AR, S→S capabilit...
2026.01
30.5
MinMo
Type=AR, S→S capabilit...
2026.01
25.5
Llama-Omni2
Type=AR, S→S capabilit...
2026.01
23.9
Phi-4-Multimodal
Type=AR, S→S capabilit...
2026.01
22.8
SpeechGPT
Type=AR, S→S capabilit...
2026.01
8.2
SpiritLM
Type=AR, S→S capabilit...
2026.01
4.2
Feedback
Search any
task
Search any
task