| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| OpenWebText | Gen. PPL1.21 | 219 | 19d ago | ||
| EMNLP 2017 WMT News | DM-LSTM | Perplexity36.11 | 64 | 3mo ago | |
| OpenWebText (OWT) (test) | TOKENDRIFT | Generation Perplexity27.35 | 30 | 14d ago | |
| ROCStories | TEncDM Enc | MAUVE86.8 | 27 | 16d ago | |
| LM1B | MDLM | Entropy4.29 | 24 | 21d ago | |
| OpenWebText (test) | D3PM Absorb | LLAMA2 Score692.3 | 21 | 3mo ago | |
| Wikipedia | TEncDM Enc | Mauve Score90.1 | 18 | 2mo ago | |
| 1024-token unconditional generation uniform source 1.3B parameters (test) | DFM | Entropy (Ent)8.1 | 13 | 23d ago | |
| Unconditional Text Generation | CoM-DAD (large) | BLEU47.46 | 11 | 3mo ago | |
| WMT News EMNLP2017 | PRPN | LM Score5.24 | 8 | 3mo ago | |
| EMNLP WMT News 2017 | DM-GPT-2 | Human Score0.512 | 8 | 3mo ago | |
| OpenWebText (OWT) length 1024 | MDLM | Training Duration (days)2.9 | 6 | 23d ago | |
| OpenWebText (held-out segments) | *Data | MAUVE1 | 6 | 2mo ago | |
| OpenWebText | *Data | MAUVE1 | 6 | 2mo ago | |
| 32 Generated Samples (inference) | AATU | Average Perplexity31.82 | 6 | 3mo ago | |
| OpenWebText | BD3-LM + Gumbel Distillation | Clarity3.41 | 4 | 2mo ago | |
| text8 unconditional | - | - | 0 | 3mo ago |