| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| HARMBENCH 159 standard behaviors (test) | ASR0 | 51 | 4d ago | ||
| AdvBench | PiF | ASR99.2 | 44 | 4d ago | |
| ConceptRisk | VII | Attack Success Rate68 | 24 | 4d ago | |
| COCO-I2VSafetyBench | ASR17 | 24 | 4d ago | ||
| HarmBench 51 (test) | WILDTEAMING | ASR@5 (Standard)98.1 | 19 | 4d ago | |
| AdvBench (test) | MTSA-R3 | ASR (GPT-3.5)72 | 12 | 4d ago | |
| AdvBench | ADV-LLM | ASR@1 (No Refusal)0 | 11 | 4d ago | |
| Llama 4 | ICON | ASR0.965 | 9 | 4d ago | |
| Llama 3.1 | ICON | ASR0.97 | 9 | 4d ago | |
| DeepSeek V3.2 | PE-CoA | Attack Success Rate78.5 | 9 | 4d ago | |
| Qwen-Max | PSA | ASR100 | 9 | 4d ago | |
| Gemini Pro 3 | ICON | ASR92.5 | 9 | 4d ago | |
| GPT-4o | ICON | ASR0.99 | 9 | 4d ago | |
| GPT 5.1 | ICON | ASR96.5 | 9 | 4d ago | |
| Claude 4.5 | ICON | ASR97 | 9 | 4d ago | |
| HarmBench Transfer attack | Vicuna-13B | GCG Success Rate65.6 | 8 | 4d ago | |
| English Visual Attack Structures Multi-Image | ASR0.1693 | 6 | 4d ago | ||
| English Visual Attack Structures Single Image | ASR0.6 | 6 | 4d ago | ||
| English Visual Attack Structures Text | Attack Success Rate (ASR)10.4 | 6 | 4d ago | ||
| JailbreakBench | PAIR with RT mutator LLM | Jailbroken Behaviors (k)1 | 5 | 4d ago | |
| SafeBench evaluated on OpenAI-o1 | FS34.8 | 1 | 4d ago |