| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Jailbreak Attack | Harmful Query Evaluation Set Claude Haiku 4.5 N=750 | Toxicity1.01 | 10 | |
| Jailbreak Attack | N=750 Harmful Query Evaluation Set Gemini-3.1-Flash-Lite | Toxicity2.75 | 10 | |
| Jailbreak Attack | Harmful Query Evaluation Set Gemini-2.5-Flash N=750 | Toxicity2.18 | 10 | |
| Jailbreak Attack | Harmful Query Evaluation Set GPT-5.4-mini N=750 | Toxicity Score1.01 | 10 | |
| Jailbreak Attack | Harmful Query Evaluation Set N=750 GPT-5.4-nano | Toxicity Score1.05 | 10 |