| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Language Modeling | RedPajama LLaMA pretraining corpus (evaluation slice) | Perplexity (bits/byte)0.62 | 15 | |
| Safety Evaluation | RedPajama Safety Evals (test) | Safety Score (Avg)93.4 | 7 | |
| Generation Quality | RedPajama Generation Quality Prefixes (test) | Standard Prefix Count32.4 | 4 |