Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TIMIT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Phoneme RecognitionTIMIT (test)
PER8.3
31
Speech EnhancementTIMIT Baby-cry noise
PESQ2.277
24
Speech EnhancementTIMIT Cafeteria noise
PESQ2.458
24
Speech EnhancementTIMIT Crowd-party noise
PESQ2.447
24
Speech EnhancementTIMIT Helicopter noise
PESQ2.677
24
Phone recognitionTIMIT (test)
Frame Error Rate17.3
23
Phoneme RecognitionTIMIT (dev)
PER7.4
20
Phoneme RecognitionTIMIT core (test)
PER10.3
20
Audio ClassificationTIMIT 3 (test)
Average Top-1 Acc95.22
18
Log-magnitude STFT predictionTIMIT 8kHz (val)
MSE14.41
15
Spoken Term DetectionTIMIT OOV
MTWV @ -5dB SNR0.07
14
Spoken Term DetectionTIMIT (IV)
MTWV (-5dB)0.03
14
Speech predictionTIMIT (test)
MSE2.76
13
Speech predictionTIMIT (val)
MSE2.86
13
Phone recognitionTIMIT (dev)
Frame Error Rate28.5
12
Log-magnitude STFT predictionTIMIT 8kHz (evaluation)
MSE14.45
11
Automatic Speech RecognitionTIMIT (test)
Accuracy85.3
10
Speech RecognitionTIMIT (test)
PER0.209
7
Voice conversionTIMIT OOD
F0 Correlation0.484
6
Phoneme RecognitionTIMIT core (dev)
PER9.1
6
Online Speech RecognitionTIMIT (test)
PER0.196
6
Phone boundary detectionTIMIT non-speech removed (test)
Precision85.31
4
Speech RecognitionTIMIT
Accuracy68.9
4
Speaker IdentificationTIMIT 462 speakers (test)
CER85
4
Articulatory Feature DetectionTIMIT (test)
Anterior Feature Accuracy0.94
4
Showing 25 of 39 rows