| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Audio Understanding | MMAR (test) | Performance67.1 | 20 | |
| Audio Reasoning | MMAR (test) | Sound Score61.4 | 17 | |
| Audio Question Answering | MMAR | Sd Score68.48 | 17 | |
| Audio Understanding | MMAR (comprehensive evaluation) | Sound Score61.21 | 15 | |
| Audio Understanding / Audio Reasoning | MMAR | Accuracy61.4 | 13 | |
| Audio Understanding | MMAR | MMAR74.7 | 12 | |
| Audio Reasoning | MMAR | Sound Accuracy73.33 | 8 | |
| Audio Reasoning | MMAR N=1,000 | Accuracy53.6 | 5 | |
| Audio Understanding & Reasoning | MMAR | Score71.9 | 3 | |
| Dense Audio Captioning | MMAR | MMAR Score46.4 | 3 | |
| Audio QA | MMAR (test) | Accuracy68.1 | 2 |