| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Language Modeling | WikiText-103 (test) | Perplexity2.22 | 579 | |
| Language Modeling | WikiText-103 (val) | PPL1.01 | 214 | |
| Language Modeling | WikiText-103 | PPL4.59 | 189 | |
| Word-level Language Modeling | WikiText-103 word-level (test) | Perplexity15.79 | 65 | |
| Word-level language modeling | WikiText-103 (dev) | Perplexity15.72 | 64 | |
| Language Modeling | Wikitext-103 | PPL3.14 | 42 | |
| Text Continuation | WikiText-103 512-token continuation (test) | Perplexity (PPL)1 | 35 | |
| Language Modeling | WikiText-103 zero-shot (test) | PPL12.76 | 34 | |
| Open-ended generation | Wikitext-103 (test) | MAUVE0.96 | 26 | |
| Language Generation | WikiText-103 | Perplexity (PPL)1 | 25 | |
| Tokenization | WikiText-103 | Latency (ms)1.92 | 25 | |
| Language Modeling | WikiText-103 | Delta PPL0 | 25 | |
| Text generation | Wikitext-103 | Perplexity32.88 | 23 | |
| Steganographic secret extraction | WikiText-103 W (test) | Accuracy75 | 20 | |
| Text Generation | WikiText-103 | Quality Better Count24 | 14 | |
| Membership Inference Attack | WikiText-103 | AUC0.784 | 14 | |
| Membership Inference Attack | WikiText-103 (test) | AUC0.782 | 13 | |
| Open-ended text generation | Wikitext-103 v1 | Diversity98.7 | 11 | |
| Language Modeling | WikiText-103 | Loss3.156 | 10 | |
| Open-ended Text Generation | Wikitext-103 | PPL2.55 | 10 | |
| Language Modeling | WikiText-103 small setting (test) | Perplexity32.8 | 10 | |
| Language Modeling | WikiText-103 small setting (val) | Perplexity31.8 | 10 | |
| Privacy-Preserving Text Generation | Wikitext-103 v1 | Cosine Similarity0.627 | 9 | |
| Autoregressive Language Modeling | Wikitext-103 | PPL18.5 | 9 | |
| Data Attribution | WikiText-103 (test) | Tail-patch Score7.88 | 9 |