Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

English

Benchmarks

Task NameDataset NameSOTA ResultTrend
Logical ReasoningEnglish
BPA98.8
24
Text-To-SpeechEnglish (test)
WER0.0165
21
General Language EvaluationEnglish lm-evaluation-harness
ARC Easy Acc (Norm)0.819
16
Dependency ParsingEnglish (en) (test)
LAS95.33
16
Incremental BPE TokenizationEnglish
Median End-to-end CPU Time (s)0.925
15
Unsupervised Constituency ParsingEnglish SPMRL (test)
S-F169.7
15
BPE TokenizationEnglish
Speedup Factor3.13
12
Implicit Discourse Relation classificationEnglish (test)
Precision62
12
TokenizationEnglish eng
NSL Score1.27
10
Morphological AlignmentEnglish 300 MB Corpora
Morph. Score64.4
9
Bias-Penalized Accuracy EvaluationEnglish
Bias-Penalized Accuracy (BPA)98.78
9
Tokenization EfficiencyEnglish
Bytes per Token3.65
6
Speech-to-Singing conversionEnglish (test)
LSD2.512
6
RST ParsingEnglish
Span Score88.2
6
audio-driven facial animationEnglish (test)
MSE0.01
5
Dysarthria DetectionEnglish
Accuracy96.57
5
Alzheimer's DetectionEnglish subset
Accuracy96.73
5
Depression DetectionEnglish
Accuracy97.04
5
Speech Intelligibility AssessmentEnglish
Absolute Kendall's Tau0.768
5
Speaker DiarizationEnglish
DER10.272
5
Simple Definition GenerationEnglish (test)
BLEU15.05
5
Named Entity RecognitionEnglish (test)
F1 Score91.05
5
Token FertilityEnglish
Fertility Score1.24
4
Singing Voice SynthesisEnglish SVS
WER17.89
4
Zero-shot Text-to-SpeechEnglish Speech Emotion Prompt
WER0.0194
4
Showing 25 of 34 rows