Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HEAR

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio Representation EvaluationHEAR (Holistic Evaluation of Audio Representations)
HEAR Average83.41
47
Scene-based Audio ClassificationHEAR Environmental Sound tasks
ESC-50 Accuracy78.9
5
Scene-based Audio ClassificationHEAR Speech tasks
CREMA-D Score0.656
5
Audio Scene ClassificationHEAR Music 2021
Beijing0.966
5
Logistic RegressionHear raw (test)
Wasserstein Distance0.644
4
Logistic RegressionHear standardized (test)
Wasserstein Distance (W)0.514
4
Showing 6 of 6 rows