| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Sound Event Detection | DCASE HEAR challenge | Onset FMS89.1 | 20 | |
| Sound Event Detection | DCASE challenge task 4 2023 (test) | PSDS10.587 | 15 | |
| Sound classification | DCASE | Accuracy97 | 15 | |
| Anomaly Detection | DCASE RMIS Benchmark | DCASE 20 Score74.2 | 14 | |
| Acoustic Scene Classification | DCASE task 1a 2020 (dev) | Overall Accuracy81.9 | 13 | |
| Sound Event Detection | DCASE DESED Task 4 2024 | tPSDS@10.583 | 12 | |
| Audio event classification | DCASE (official) | Top-1 Accuracy95 | 9 | |
| Sound recognition | DCASE 2014 (test) | Top-1 Accuracy96 | 8 | |
| Environmental Sound Classification | DCASE Task 1 2019 (incremental split (5 tasks)) | Accuracy58.1 | 6 | |
| Sound Event Localization and Detection | DCASE Task 3 Stereo SELD 2025 (test) | F1 Score40.2 | 6 | |
| Anomalous Sound Detection | DCASE Task 2 2020 (test) | Fan AUC85.88 | 6 | |
| Sound Event Detection | DCASE challenge task 4 2023 (val) | PSDS10.587 | 6 | |
| Audio Captioning | DCASE Task 6 2020 (dev test) | BLEU-153.7 | 6 | |
| Audio Classification | DCASE 2014 | Accuracy94 | 6 | |
| Audio Tagging | DCASE Task 2 2018 (test) | mAP@395.4 | 5 | |
| Sound Event Classification | DCASE17 Task 4 (test) | F1 Score33.8 | 4 | |
| Acoustic Scenes | DCASE Task 1a 2019 (test) | Accuracy76.1 | 3 |