Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

English

Benchmarks

Task NameDataset NameSOTA ResultTrend
Logical ReasoningEnglish
BPA98.8
24
Text-To-SpeechEnglish (test)
WER0.0165
21
Dependency ParsingEnglish (en) (test)
LAS95.33
16
Unsupervised Constituency ParsingEnglish SPMRL (test)
S-F169.7
15
Implicit Discourse Relation classificationEnglish (test)
Precision62
12
Morphological AlignmentEnglish 300 MB Corpora
Morph. Score64.4
9
Bias-Penalized Accuracy EvaluationEnglish
Bias-Penalized Accuracy (BPA)98.78
9
Speech-to-Singing conversionEnglish (test)
LSD2.512
6
RST ParsingEnglish
Span Score88.2
6
Speech Intelligibility AssessmentEnglish
Absolute Kendall's Tau0.768
5
Speaker DiarizationEnglish
DER10.272
5
Simple Definition GenerationEnglish (test)
BLEU15.05
5
Named Entity RecognitionEnglish (test)
F1 Score91.05
5
Zero-shot Text-to-SpeechEnglish Speech Emotion Prompt
WER0.0194
4
Handwriting GenerationEnglish (test)
Content Score0.8552
4
TokenizationEnglish Reasoning
Average Tokens per Sample6,192.77
3
TokenizationEnglish General
Avg Tokens per Sample794.79
3
Language ModelingEnglish Tail (test)
Relative P95 RTF Reduction4.66
3
Language ModelingEnglish VA (test)
Relative P95 RTF Reduction-23.79
3
Vector Font ReconstructionEnglish EN (test)
Error5.2
3
Complex Definition GenerationEnglish (test)
BLEU24.17
3
General Language EvaluationEnglish lm-evaluation-harness
AGIEval Acc (Norm)0.259
2
Showing 22 of 22 rows