Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AudioSet

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio ClassificationAudioSet 20K
mAP47.8
147
Anti-steganalysisAudioset
P_E49.94
99
Audio ClassificationAudioSet 2M
mAP50.5
98
Audio ReconstructionAudioSet (eval)
Mel Distance0.382
63
1D audio reconstructionAudioSet
NMSE0.006
63
Audio ClassificationAudioSet
mAP49.6
60
ClassificationAudioSet (test)
mAP49.6
57
Audio Event TaggingAudioSet AS-2M (full)
mAP50.2
45
Sound ClassificationAudioSet (evaluation)
mAP47.1
39
Acoustic event detectionAudioSet (test)
mAP0.462
34
Audio ClassificationAudioSet-2M (full)
mAP48.6
32
Audio TaggingAudioSet (test)
mAP50
25
Audio Event TaggingAudioSet (AS-20K)
mAP46.7
24
Audio ReconstructionAudioSet (test)
Mel Distance (16kHz)0.32
23
Audio ClassificationAudioSet Full (test)
mAP45.9
23
ClassificationAudioSet AS-2M
mAP (%)50.2
21
Audio ClassificationAudioSet 20k (train test)
mAP31.67
19
Generalized Zero-Shot Retrieval (Text-to-Audio)AudioSet ZSL (test)
mAP (S)72.25
19
Sound Event DetectionAudioSet Strongly-labeled (test)
PSDS1 (w/o var-pen)0.374
18
Audio-visual event classificationAudioSet 2M
mAP (Audio-only)49.1
16
Generalized Zero-Shot ClassificationAudioSet ZSL (test)
mAcc (Seen)50.96
16
Audio GenerationAudioSet AAR 20k
Minimum LSD0
15
Autoregressive audio (AAR)AudioSet 20k (subset of 100 random 10 s clips)
Compression Ratio2.75
15
Multi-class Music ClassificationAudioSet
Accuracy58.71
14
Audio TaggingAudioSet balanced (AS-20k)
mAP40.2
14
Showing 25 of 90 rows