| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Jailbreak Attack | MaliciousInstruct | ASR100 | 35 | |
| Adversarial and Jailbreaking Attack Detection | MaliciousInstruct | AUROC0.8825 | 20 | |
| Visual Jailbreaking Attack | MaliciousInstruct | ASR92 | 16 | |
| Jailbreak Attack | MaliciousInstruct (test) | ASR (Refusal)95 | 10 | |
| Jailbreak Attack | MaliciousInstruct 41 (test) | ASR0.935 | 6 |