Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Audio Question Answering on WenetSpeech-QA
Loading...
48.47
Audio-QA Score
GLM-4-Voice
-1.2108
11.6871
24.585
37.4829
Sep 24, 2025
Audio-QA Score
Updated 23d ago
Evaluation Results
Method
Method
Links
Audio-QA Score
GLM-4-Voice
Size=9B
2025.09
48.47
Kimi-Audio
Size=7B
2025.09
43.2
VITA-Audio
Size=7B
2025.09
30.75
LLaMA-Omni
Size=8B
2025.09
30.28
SpeechGPT
Size=7B
2025.09
24.53
Pretrain+TtT
Size=3B
2025.09
21.43
Moshi
Size=7B
2025.09
16.85
TtT
Size=3B
2025.09
11.61
SLAM-Omni
Size=0.5B
2025.09
7.9
Mini-Omni
Size=0.5B
2025.09
2.42
Qwen2.5-3B (AR)
Size=3B
2025.09
0.7
Qwen2.5-3B (NAR)
Size=3B
2025.09
0.7
Feedback
Search any
task
Search any
task