| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Utility Evaluation | SLIMORCA (test) | Score68.85 | 24 | |
| Win rate evaluation | SLIMORCA (test) | Win Rate88.12 | 8 | |
| Language Modeling | SlimOrca (test) | Test PPL3.81 | 3 | |
| Safety Evaluation | SLIMORCA OOD Safety Scenario (test) | Harmfulness Score2.4 | 2 |