Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Speech-to-Text Question Answering on OBQA
Loading...
83.08
Accuracy
APin
9.5728
28.6564
47.74
66.8236
Jan 30, 2026
Feb 10, 2026
Feb 21, 2026
Mar 5, 2026
Mar 16, 2026
Mar 27, 2026
Apr 8, 2026
Accuracy
Updated 9d ago
Evaluation Results
Method
Method
Links
Accuracy
APin
Setting=Aggressive (τi...
2026.04
83.08
APin
Setting=Conservative (...
2026.04
82.64
APdeep
Setting=Conservative (...
2026.04
82.64
DAP
Setting=Aggressive (τi...
2026.04
82.42
APdeep
Setting=Aggressive (τi...
2026.04
82.2
Vanilla
lin=-, ldeep=-, FRR=10...
2026.04
81.98
DAP
Setting=Conservative (...
2026.04
81.98
Phi-4-Multimodal
Type=AR, S→S capabilit...
2026.01
65.9
DIFFUSPEECH
Type=Diff., S→S capabi...
2026.01
51.3
Qwen2-Audio
Type=AR, S→S capabilit...
2026.01
49.5
MinMo
Type=AR, S→S capabilit...
2026.01
44.5
DiFFA
Type=Diff., S→S capabi...
2026.01
35.6
Llama-Omni2
Type=AR, S→S capabilit...
2026.01
28.1
Moshi
Type=AR, S→S capabilit...
2026.01
26
SpiritLM
Type=AR, S→S capabilit...
2026.01
21.7
SpeechGPT
Type=AR, S→S capabilit...
2026.01
12.4
Feedback
Search any
task
Search any
task