| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Jailbreak Attack | Claude 3.5 | ASR0 | 19 | |
| Jailbreak Attack | Claude Sonnet API 3.5 | ASR80.5 | 16 | |
| Black-box Adversarial Attack | Claude thinking 4.0 | KMR (a)0.02 | 9 | |
| Jailbreaking | Claude 4.5 | ASR97 | 9 | |
| AI-generated text detection | Claude-generated (test) | F1 Score92.2 | 5 |