| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Safety evaluation | Sorry-Bench | Safety Score99.09 | 90 | |
| Safety Alignment | SORRY-Bench | ASR10.22 | 40 | |
| Safety Evaluation | Sorry-Bench base | Safety Score92.73 | 27 | |
| Harmful Request Defense | SORRY-Bench | ASR13 | 24 | |
| Refusal Control | SORRY-Bench | Refusal Rate70.45 | 7 |