Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Spoken Question Answering on MuSiQue v1.0 (test)
Loading...
53.99
Exact Match (EM)
AEG (with LFE)
44.3492
46.8521
49.355
51.8579
Mar 17, 2026
Exact Match (EM)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Exact Match (EM)
AEG (with LFE)
Model=GPT-4o Audio
2026.03
53.99
AEG (with LFE)
Model=LongCat-Flash-Omni
2026.03
53.99
AEG (w/o LFE)
Model=GPT-4o Audio
2026.03
53.7
AEG (w/o LFE)
Model=LongCat-Flash-Omni
2026.03
52.38
baseline
Model=GPT-4o Audio
2026.03
51.51
baseline
Model=LongCat-Flash-Omni
2026.03
49.57
AEG (with LFE)
Model=Qwen3-Omni-30B-A3B
2026.03
48.61
AEG (w/o LFE)
Model=Qwen3-Omni-30B-A3B
2026.03
47.62
baseline
Model=Qwen3-Omni-30B-A3B
2026.03
45.88
AEG (with LFE)
Model=Qwen3-Omni Flash
2026.03
45.72
AEG (w/o LFE)
Model=Qwen3-Omni Flash
2026.03
44.93
baseline
Model=Qwen3-Omni Flash
2026.03
44.72
Feedback
Search any
task
Search any
task