Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Spoken Instruction Following on OOD Spoken Instruction Suite (test)
Loading...
96.6
Consistency
VIRBA
80.584
84.742
88.9
93.058
Aug 25, 2025
Consistency
Updated 13d ago
Evaluation Results
Method
Method
Links
Consistency
VIRBA
Backbone=Qwen2.5-Omni
2025.08
96.6
VIRBA
Backbone=Qwen2-Audio
2025.08
95.8
Step-Audio-R1
2025.08
93.9
Qwen2.5-Omni
2025.08
93.2
Single-view RL
2025.08
90.6
TTS-SFT
2025.08
88.4
Qwen2-Audio base
2025.08
81.2
Feedback
Search any
task
Search any
task