| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Language Modeling | OWT | Gen. PPL47.13 | 61 | |
| Text Generation | OWT | GPT2 Perplexity5.33 | 41 | |
| Conditional Generation | OWT | Perplexity (PPL)19.99 | 24 | |
| Language Modeling | OWT L=1024 (test) | NELBO PPL20.96 | 11 | |
| Language Modeling | OWT (val) | PPL17.5 | 7 | |
| Generation Latency | OWT L=8192 | Generation Latency (s)54 | 5 | |
| Generation Latency | OWT L=2048 | Sampling Latency (s)13.3 | 5 | |
| Language Modeling | OWT L=10240 (test) | Gen. PPL23.4 | 2 |