| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| WikiText2 | Perplexity2.86 | 2,839 | 2d ago | ||
| WikiText-2 (test) | PPL2.56 | 1,949 | 2d ago | ||
| WikiText-2 | TCN-SEQ-I | Perplexity (PPL)1.61 | 1,624 | 2d ago | |
| C4 | Llama2-7B | Perplexity4.77 | 1,422 | 2d ago | |
| C4 | Wanda | Perplexity1 | 1,071 | 3d ago | |
| PTB | Perplexity4.345 | 1,034 | 2d ago | ||
| WikiText | Llama 3.1-70B | PPL0.2838 | 732 | 3d ago | |
| WikiText-103 (test) | RETRO | Perplexity2.22 | 579 | 12d ago | |
| PTB (test) | Perplexity8.159 | 526 | 2d ago | ||
| C4 (val) | PPL5.709 | 514 | 4d ago | ||
| Penn Treebank (test) | GL-LWGC-AWD-MOS-LSTM + dynamic evaluation | Perplexity46.34 | 411 | 1mo ago | |
| WikiText2 (val) | TCN-SEQ-J | Perplexity (PPL)3.03 | 387 | 11d ago | |
| WikiText2 v1 (test) | Perplexity1.7 | 383 | 1mo ago | ||
| C4 (test) | Perplexity4.97 | 342 | 2d ago | ||
| Wiki | Wanda | Perplexity (PPL)2 | 281 | 5d ago | |
| LAMBADA | PaLM-2 L | Accuracy86.9 | 268 | 8d ago | |
| WikiText-103 (val) | PPL1.01 | 214 | 12d ago | ||
| WT2, PTB, and C4 Macro Average (test) | AWQ | Perplexity13.1 | 192 | 26d ago | |
| WikiText-103 | ESPACE | PPL4.59 | 189 | 4d ago | |
| Penn Treebank (val) | GL-LWGC-AWD-MOS-LSTM + dynamic evaluation | Perplexity46.64 | 178 | 1mo ago | |
| Wikitext2 | Perplexity2.58 | 162 | 8d ago | ||
| PG-19 | CREAM | Perplexity5 | 160 | 4d ago | |
| FineWeb (val) | UMTAM | Validation Loss2.03 | 159 | 4d ago | |
| LAMBADA | Perplexity3.1 | 150 | 24d ago | ||
| Wiki2 | PPL4.01 | 149 | 4d ago |