| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Audio Classification | GTZAN | Accuracy90.4 | 54 | |
| Classification | GTZAN (test) | Accuracy82.1 | 23 | |
| Music genre and Speech vs Music classification | GTZAN | Genre Accuracy86.8 | 22 | |
| Music Genre Classification | GTZAN | Accuracy89.8 | 19 | |
| Audio Classification | GTzan 10 | Top-1 Accuracy74.1 | 15 | |
| Music Genre Classification | GTZAN (test) | Accuracy96 | 8 | |
| Beat Tracking | GTZAN | F-measure80.64 | 8 | |
| Beat Tracking | GTZAN 1000 excerpts | F-Measure76.48 | 7 | |
| Downbeat Tracking | GTZAN (test) | F1 Score78.3 | 5 | |
| Beat Tracking | GTZAN (test) | F1 Score89.1 | 5 | |
| Music Genre Classification | GTZAN (10-fold cross val) | Accuracy93.9 | 5 | |
| Continual Learning | GTZAN (5-session split) | Accuracy78 | 4 | |
| Audio Classification | GTZAN Genres zero-shot | Zero-Shot Accuracy71 | 4 | |
| Downbeat Tracking | GTZAN | F1 Score54.07 | 3 | |
| Downbeat Tracking | GTZAN 1000 excerpts (Entire dataset) | F-Measure46.49 | 2 | |
| Audio Signal Representation | GTZAN (test) | Metric- | 0 |