| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Audio Classification | TUT 2017 | Base Accuracy51.6 | 13 | |
| Audio Comprehension | TUT 2017 (test) | Accuracy0.821 | 8 | |
| Audio Classification | TUT 2017 (test) | Accuracy82.88 | 7 | |
| Audio Understanding | TUT 2017 | Accuracy68.09 | 7 | |
| Audio Classification | TUT 2017 | Score71.2 | 7 | |
| Acoustic Scene Classification | TUT 2017 | Accuracy71.2 | 6 | |
| Audio Classification | TUT17 | Accuracy64.9 | 6 | |
| Multiple-choice Audio Question Answering | TUT 2017 (test) | Accuracy78.4 | 3 |