| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| WikiText2 | Perplexity2.73 | 3,785 | 14h ago | ||
| WikiText-2 (test) | PPL1.2 | 2,333 | 15h ago | ||
| WikiText-2 | Calibration-free SLEB | Perplexity (PPL)1.08 | 2,320 | 17h ago | |
| C4 | Wanda | Perplexity1 | 1,688 | 12d ago | |
| C4 | Llama2-7B | Perplexity4.77 | 1,565 | 18d ago | |
| PTB | Perplexity4.345 | 1,234 | 14d ago | ||
| WikiText | Llama 3.1-70B | PPL0.2838 | 740 | 20d ago | |
| C4 (val) | PPL1.7 | 737 | 19h ago | ||
| WikiText-103 (test) | RETRO | Perplexity2.22 | 703 | 1d ago | |
| PTB (test) | Perplexity8.159 | 543 | 6d ago | ||
| C4 (test) | Perplexity4.97 | 464 | 18h ago | ||
| WikiText2 (val) | TCN-SEQ-J | Perplexity (PPL)3.03 | 423 | 1d ago | |
| Penn Treebank (test) | GL-LWGC-AWD-MOS-LSTM + dynamic evaluation | Perplexity46.34 | 420 | 5d ago | |
| LAMBADA | PaLM-2 L | Accuracy86.9 | 412 | 4d ago | |
| WikiText2 v1 (test) | Perplexity1.7 | 383 | 2mo ago | ||
| Wiki2 | PPL4.01 | 326 | 14d ago | ||
| Wiki | Wanda | Perplexity (PPL)2 | 298 | 1mo ago | |
| Wikitext2 | Perplexity2.58 | 277 | 14d ago | ||
| WikiText-103 (val) | PPL1.01 | 261 | 1d ago | ||
| WikiText | Word Perplexity3.12 | 234 | 17h ago | ||
| FineWeb (val) | UMTAM | Validation Loss2.03 | 217 | 11d ago | |
| WikiText-103 | ESPACE | PPL4.59 | 216 | 26d ago | |
| PG-19 | CREAM | Perplexity5 | 206 | 19h ago | |
| LAMBADA | Perplexity3.1 | 198 | 20d ago | ||
| WT2, PTB, and C4 Macro Average (test) | AWQ | Perplexity13.1 | 192 | 2mo ago |