| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MMAR (test) | Gemini 2.5 Pro | Average Score74.4 | 57 | 1mo ago | |
| MMAR | Average Accuracy83.7 | 38 | 1d ago | ||
| MMAU-Pro | AudioToolAgent | Average Score61.9 | 18 | 1mo ago | |
| MMAU mini (test) | Omni-R1 | Average Score77.7 | 17 | 1mo ago | |
| MMAU mini 1.0 (test) | Step-Audio 2 | Sound Score83.48 | 15 | 3mo ago | |
| MMAU v05.15.25 (base) | SAM+OR-780M | Sound Accuracy59.73 | 10 | 2mo ago | |
| MMAU v05.15.25 (mini) | SAM+OR-2.7B | Sound Score61.86 | 10 | 2mo ago | |
| SoundMind | Step-Audio-R1 | Accuracy69.8 | 9 | 13d ago | |
| SAKURA | Qwen2-Audio-Instruct | Single Score81.2 | 8 | 2mo ago | |
| MMAR Agent Track | Accuracy77.4 | 8 | 2mo ago | ||
| MMAU | Qwen3-Omni | Accuracy76.5 | 7 | 1mo ago | |
| MMSU | OmniJigsaw | Accuracy (Audio Reasoning)70.7 | 7 | 1mo ago | |
| MMAR N=1,000 | CoFi-Agent | Accuracy53.6 | 5 | 3mo ago | |
| CompA R (test) | AF-Next-Instruct | Accuracy98.7 | 2 | 1mo ago |