| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Audio Reasoning | MMAR (test) | Sound Score67.3 | 38 | |
| Audio Question Answering | MMAR | Average Score70.03 | 35 | |
| Audio Understanding | MMAR (comprehensive evaluation) | Sound Score62.4 | 25 | |
| Multimodal Audio Reasoning | MMAR | Mean Score63.5 | 22 | |
| Audio Understanding | MMAR (test) | Performance67.1 | 20 | |
| Audio Reasoning | MMAR | Average Accuracy77 | 15 | |
| Audio Perception and Reasoning | MMAR within CAFE framework (overall) | Perception Accuracy63.51 | 13 | |
| Audio Understanding / Audio Reasoning | MMAR | Accuracy61.4 | 13 | |
| Audio Understanding | MMAR | MMAR74.7 | 12 | |
| Audio Reasoning | MMAR Agent Track | Accuracy77.4 | 8 | |
| Audio Reasoning | MMAR N=1,000 | Accuracy53.6 | 5 | |
| Audio Understanding & Reasoning | MMAR | Score71.9 | 3 | |
| Dense Audio Captioning | MMAR | MMAR Score46.4 | 3 | |
| Audio QA | MMAR (test) | Accuracy68.1 | 2 |