Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HEAR

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio Representation EvaluationHEAR (Holistic Evaluation of Audio Representations)
CREMA-D76.7
35
Scene-based Audio ClassificationHEAR Environmental Sound tasks
ESC-50 Accuracy78.9
5
Scene-based Audio ClassificationHEAR Speech tasks
CREMA-D Score0.656
5
Audio Scene ClassificationHEAR Music 2021
Beijing0.966
5
Showing 4 of 4 rows