| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Jailbreak Attack | HarmfulQA | JADES56 | 33 | |
| Harmlessness evaluation | HarmfulQA | Helpfulness Score69.4 | 33 | |
| Safety Evaluation | HARMFULQA various domains | Safety Score (Chinese)19.17 | 8 | |
| Red-Teaming (Attack Success Rate) | HARMFULQA | ASR0.702 | 7 | |
| Jailbreak Attack Evaluation | HarmfulQA | ASR16 | 6 | |
| Language Modeling | HarmfulQA | PPL83.41 | 1 |