| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Language Modeling | PTB | Perplexity8.159 | 650 | |
| Language Modeling | PTB (test) | Perplexity8.159 | 471 | |
| Imputation | PTB | PRD4.16 | 162 | |
| Constituent parsing | PTB (test) | F196.48 | 127 | |
| Language Modeling | PTB (val) | Perplexity36.14 | 83 | |
| Unsupervised Parsing | PTB (test) | F1 Score83.3 | 75 | |
| Open-ended text generation | PTB | COH Score67.7 | 64 | |
| Phrase-structure parsing | PTB (§23) | F1 Score95.8 | 56 | |
| Character-level Prediction | PTB (test) | BPC (Test)1.045 | 42 | |
| Grammar Induction | PTB English (test) | F1 Score84.3 | 29 | |
| Dependency Parsing | PTB | UAS97.4 | 24 | |
| POS tagging | PTB (test) | Accuracy97.78 | 24 | |
| Language Modeling | PTB zero-shot | Perplexity82.05 | 23 | |
| Word-level prediction | PTB word-level (test) | Perplexity72.8 | 19 | |
| Parsing | PTB (test) | Sents/sec1,092 | 17 | |
| Language Modeling | PTB English Mikolov preprocessed (val) | Perplexity44.9 | 13 | |
| Constituency Parsing | PTB Section 23 (test) | Rerank F10.9513 | 13 | |
| ECG Classification | PTB database | Accuracy99.43 | 13 | |
| Uncertainty Quantification | PTB | CU417 | 12 | |
| Membership Inference Attack | PTB | AUC59.28 | 12 | |
| Constituency Parsing | PTB (test) | Speed (Sents/s)1,127 | 12 | |
| Medical Time Series Classification | PTB-XL 5-Classes (test) | Accuracy0.7353 | 11 | |
| Medical Time Series Classification | PTB 2-Classes (test) | Accuracy0.8596 | 11 | |
| Text Generation | PTB (test) | Grammaticality0.447 | 10 | |
| segment-level identification | PTB closed-set segment-level (test) | Accuracy99.1678 | 8 |