Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PTB

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingPTB
Perplexity4.345
1,234
Language ModelingPTB (test)
Perplexity8.159
543
ImputationPTB
PRD4.16
162
Constituent parsingPTB (test)
F196.48
155
Language ModelingPTB (val)
Perplexity20.26
107
Unsupervised ParsingPTB (test)
F1 Score83.3
75
Open-ended text generationPTB
COH Score67.7
64
Phrase-structure parsingPTB (§23)
F1 Score95.8
56
ECG ClassificationPTB-XL (test)
AUC97.7
46
Character-level PredictionPTB (test)
BPC (Test)1.045
42
Dependency ParsingPTB
LAS96.3
31
Grammar InductionPTB English (test)
F1 Score84.3
29
Time-series ClassificationPTB 2 classes
Accuracy99.9
26
ECG ClassificationPTB-XL
AUROC96
26
Language ModelingPTB zero-shot
Perplexity82.05
25
POS taggingPTB (test)
Accuracy97.78
24
Word-level predictionPTB word-level (test)
Perplexity72.8
19
ParsingPTB (test)
Sents/sec1,092
17
2-class ECG classificationPTB ECG-2 (Cross-subject)
Accuracy87.73
13
Language ModelingPTB English Mikolov preprocessed (val)
Perplexity44.9
13
Constituency ParsingPTB Section 23 (test)
Rerank F10.9513
13
ECG ClassificationPTB database
Accuracy99.43
13
Medical Time Series ClassificationPTB 2-Classes
Accuracy85.39
12
Conditional GenerationPTB
Perplexity41.2
12
Uncertainty QuantificationPTB
CU417
12
Showing 25 of 65 rows