| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Language Modeling | Language Modeling | Perplexity8.005 | 26 | |
| Language Modeling | Language Modeling Evaluation | Perplexity (PPL)1.71 | 16 | |
| Language Modeling | Language Modeling Average | PPL5.67 | 12 | |
| Membership Inference Attack | Language Modeling PII-annotated (train) | TPR @ 0.1% FPR21.5 | 9 | |
| Language Modeling | Language Modeling (LM) | CE (128-255 tokens)2.69 | 7 | |
| Language Modeling | Language Modeling (test) | PPL6.2 | 7 |