Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Spoken Question Answering on SQuAD v1.1 (test)
Loading...
89.24
EM
AEG (with LFE)
79.1208
81.7479
84.375
87.0021
Mar 17, 2026
EM
Updated 1mo ago
Evaluation Results
Method
Method
Links
EM
AEG (with LFE)
Model=Qwen3-Omni-30B-A3B
2026.03
89.24
AEG (w/o LFE)
Model=Qwen3-Omni-30B-A3B
2026.03
88.96
AEG (with LFE)
Model=GPT-4o Audio
2026.03
88.94
baseline
Model=GPT-4o Audio
2026.03
88.49
baseline
Model=Qwen3-Omni-30B-A3B
2026.03
88.37
AEG (with LFE)
Model=LongCat-Flash-Omni
2026.03
87.07
AEG (w/o LFE)
Model=GPT-4o Audio
2026.03
86.94
AEG (w/o LFE)
Model=LongCat-Flash-Omni
2026.03
86.61
baseline
Model=LongCat-Flash-Omni
2026.03
84.32
AEG (with LFE)
Model=Qwen3-Omni Flash
2026.03
80.92
AEG (w/o LFE)
Model=Qwen3-Omni Flash
2026.03
80.36
baseline
Model=Qwen3-Omni Flash
2026.03
79.51
Feedback
Search any
task
Search any
task