| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Language Modeling | WikiText-103 (test) | Perplexity2.22 | 524 | |
| Language Modeling | WikiText-103 (val) | PPL1.01 | 180 | |
| Language Modeling | WikiText-103 | PPL4.59 | 146 | |
| Word-level Language Modeling | WikiText-103 word-level (test) | Perplexity15.79 | 65 | |
| Word-level language modeling | WikiText-103 (dev) | Perplexity15.72 | 64 | |
| Language Modeling | Wikitext-103 | PPL3.14 | 42 | |
| Language Modeling | WikiText-103 zero-shot (test) | PPL12.76 | 34 | |
| Open-ended generation | Wikitext-103 (test) | MAUVE0.96 | 26 | |
| Steganographic secret extraction | WikiText-103 W (test) | Accuracy75 | 20 | |
| Language Modeling | WikiText-103 | Delta PPL0.05 | 16 | |
| Text Generation | WikiText-103 | Quality Better Count24 | 14 | |
| Membership Inference Attack | WikiText-103 | AUC0.784 | 14 | |
| Membership Inference Attack | WikiText-103 (test) | AUC0.782 | 13 | |
| Open-ended Text Generation | Wikitext-103 | PPL2.55 | 10 | |
| Language Modeling | WikiText-103 small setting (test) | Perplexity32.8 | 10 | |
| Language Modeling | WikiText-103 small setting (val) | Perplexity31.8 | 10 | |
| Autoregressive Language Modeling | Wikitext-103 | PPL18.5 | 9 | |
| Data Attribution | WikiText-103 (test) | Tail-patch Score7.88 | 9 | |
| Data Attribution | WikiText-103 | LDS18.33 | 8 | |
| Language Modeling | WikiText-103 (dev) | Perplexity17.9 | 8 | |
| Language Modeling | WikiText-103 | PPL6.88 | 8 | |
| Document Continuation | WikiText-103 (test) | MAUVE13.96 | 8 | |
| Repetition reduction | WikiText-103 (test) | PPL23.82 | 8 | |
| Open-ended text generation | Wikitext-103 (test) | Win Rate84 | 8 | |
| Autoregressive language modeling | WikiText-103 1024 (test) | PPL19 | 7 |