Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Audio

Benchmarks

Task NameDataset NameSOTA ResultTrend
Temporal AttributionAudio
I(100)74.3
13
Negative temporal attributionAudio
Δŷc(2%)-0.25
13
Analysis-synthesisAudio Industrial
FAD0
12
Audio Compression Quality AssessmentAudio 24kHz
Speech Quality Score94.65
12
Boolean Matrix Factorization Completionaudio missing entries
Objective Value Improvement41
9
Audio CodingAudio 16kHz 22kHz (test)
Bitrate (kbps)0.7
8
Generation Success RateAudio suite (test)
Gauss Success Rate75.22
6
Inference SpeedAudio 120s (test)
Inference Time (ms)479
5
Inference SpeedAudio 60s (test)
Inference Time (ms)244
5
Inference SpeedAudio 30s (test)
Inference Time (ms)125
5
Inference SpeedAudio 10s (test)
Inference Time (ms)42
5
Audio Quality EvaluationAudio Evaluation Set
ESTOI43
5
Density EstimationAudio Twenty Datasets (test)
Log-LH-39.74
4
Text-to-Audio Classificationaudio 2024 (test)
Species Top-1 Accuracy27.7
2
Audio-to-Text Classificationaudio 2024 (test)
Species Top-1 Accuracy24.4
2
Audio Reconstruction Quality48 kHz audio (test)
STOI0.996
1
Showing 16 of 16 rows