| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| ESC-50 | CALM | Accuracy99.25 | 441 | 2d ago | |
| AudioSet 20K | CAT | mAP47.8 | 147 | 19d ago | |
| Urbansound8K | ConformerXL-P Non-RA | Accuracy97.5 | 126 | 2mo ago | |
| AudioSet 2M | MiCo | mAP50.5 | 98 | 19d ago | |
| ESC-50 (test) | M2D2 | Accuracy98.5 | 87 | 2mo ago | |
| VGG-Sound | CALM | Top-1 Accuracy88.15 | 83 | 2mo ago | |
| SPC V2 | SupMAM-CLAP | Accuracy98.7 | 65 | 3mo ago | |
| ESC50 | XKD | Top-1 Acc96.5 | 64 | 3mo ago | |
| AudioSet | mAP49.6 | 60 | 15d ago | ||
| GTZAN | Nystromformer | Accuracy94.66 | 59 | 2mo ago | |
| Speech Commands (test) | Chimera | Accuracy98.4 | 49 | 26d ago | |
| Speech Commands V2 (test) | ASiT | Accuracy98.9 | 46 | 2mo ago | |
| SHD (test) | DelRec* | Accuracy91.72 | 41 | 1mo ago | |
| US8K (test) | EAT (LoRA) + Joint Training | R@1 Accuracy0.9807 | 41 | 3mo ago | |
| Beijing-Opera | CoOp + SEPT | Base Accuracy97.88 | 34 | 2mo ago | |
| AudioSet-2M (full) | BEATSiter3+ | mAP48.6 | 32 | 3mo ago | |
| ESC50 (test) | MUKA | R@1 Accuracy0.9803 | 28 | 3mo ago | |
| VocalSound | SAM-2.7B | Accuracy72.2 | 26 | 14d ago | |
| Speech Commands Spectrogram Mini (train) | Training Loss0.219 | 24 | 3mo ago | ||
| Speech Commands Spectrogram Mini (val) | Accuracy83.3 | 24 | 3mo ago | ||
| Vocal Sound | Score92.4 | 23 | 1mo ago | ||
| AudioSet Full (test) | AST (w/o ensemble) | mAP45.9 | 23 | 3mo ago | |
| OpenMIC | BEATs iter3+ | mAP86.7 | 22 | 1mo ago | |
| AudioSet 20k (train test) | protobin | mAP31.67 | 19 | 2d ago | |
| US8K | MATPAC++ | Top-1 Accuracy89.7 | 19 | 1mo ago |