| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Safety Evaluation | Safety | Score92.1 | 27 | |
| Alignment and Safety Evaluation | Safety | Avg@k73 | 15 | |
| Safety Assessment | Safety Avg. | MAE2.6912 | 14 | |
| Safety | Safety OOD | Accuracy93.14 | 13 | |
| Safety | Safety ID | Accuracy99.81 | 13 | |
| Safety Evaluation | Safety Tweet Eval, Hatecheck, Ethos (test) | Accuracy83.3 | 12 | |
| Safety Alignment | Safety BeaverTails, HEx-PHI (test) | BeaverTails Score95.67 | 10 | |
| Safety Evaluation | Safety Overall | Reasoning Accuracy (Avg)32.9 | 4 |