| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Sound Foundation | AIR-Bench 1.0 (test) | Score65.1 | 13 | |
| Safety | AIR-Bench | Average Score0.66 | 12 | |
| Paralinguistic speech understanding | AIR-Bench Speech (test) | Emotion Acc71.45 | 11 | |
| Chat Benchmark | AIR-Bench | Score (Speech Domain)7.54 | 11 | |
| Retrieval | AIR-Bench English 24.04 | Wiki Score65.5 | 10 | |
| Question Answering | AIR-Bench Foundation | Accuracy36.8 | 8 | |
| Content Moderation | AIR-Bench Text + Image (test) | Precision83 | 8 | |
| Content Moderation | AIR-Bench Image Only (test) | Precision94 | 8 | |
| Content Moderation | AIR-Bench Text Only (test) | Precision94 | 8 | |
| Music Foundation Tasks | AIR-Bench Music 1.0 (test) | Inst. Classification Acc65.8 | 7 | |
| Speech Foundation | AIR-Bench Speech Foundation | Speech Grounding5,920 | 7 | |
| Speech Chat | AIR-Bench 1.0 (test) | Overall Score7.18 | 7 | |
| Gender Classification | Air-Bench | Accuracy0.905 | 6 | |
| Open-Ended Audio Understanding | AIR-Bench chat | AIR-Bench Chat Score6.8 | 3 |