| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multiple-Choice Questioning | SafetyBench English (test) | Accuracy60.13 | 35 | |
| Safety Evaluation | SafetyBench | Accuracy89 | 28 | |
| Safety Evaluation | SafetyBench | Safety69.4 | 26 | |
| Safety Evaluation | SafetyBench en | Avg Score81.2 | 25 | |
| Safety Evaluation | SafetyBench zh | Avg Score83.2 | 21 | |
| Jailbreak Attack Evaluation | SafetyBench MCV | ASR (1-Clip)79.79 | 16 | |
| Safety Evaluation | SafetyBench (test) | Accuracy81.321 | 9 | |
| Jailbreak Attack | SafetyBench LLaVA-2 Integrated from AdvBench (test) | Illegal Activity Success Rate83.73 | 4 | |
| Jailbreak Attack | SafetyBench MiniGPT-4 Integrated from AdvBench (test) | IA (Illegal Activity)0.7024 | 4 |