| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| AdvBench | AMIS | ASR100 | 132 | 1mo ago | |
| AdvBench selected models | Few-shot threat model | ASR@10100 | 90 | 9d ago | |
| HarmBench | UJEM-KL | Attack Success Rate (ASR)82.3 | 68 | 21d ago | |
| AdvBench | JULI | BERT Score4.84 | 55 | 2mo ago | |
| JailbreakBench | Attack Success Rate (ASR)2 | 53 | 28d ago | ||
| HARMBENCH 159 standard behaviors (test) | ASR0 | 51 | 3mo ago | ||
| MHSC | TCBS-Attack | ASR-431 | 44 | 2mo ago | |
| Q16 | TCBS-Attack | ASR-452.5 | 44 | 2mo ago | |
| AdvBench Sub | BERT Score4.73 | 40 | 2mo ago | ||
| JBB Behaviors | AMIS | ASR100 | 35 | 2mo ago | |
| AdvBench (test) | CHaRS-PCT | Average ASR99.04 | 33 | 3mo ago | |
| MaliciousInstruct (test) | JULI | BERT Score4.63 | 32 | 2mo ago | |
| EHRAgent TREQS | SR71.78 | 30 | 1mo ago | ||
| EHRAgent eICU | Success Rate (SR)57.93 | 30 | 1mo ago | ||
| EHRAgent MIMIC-III | SR56.55 | 30 | 1mo ago | ||
| StrongReject (test) | HMNS | ASR (GPT-4o)96 | 27 | 1mo ago | |
| JBB-Behaviors (test) | HMNS | ASR (GPT-4o)99 | 27 | 1mo ago | |
| HarmBench (test) | HMNS | ASR (GPT-4o)97 | 27 | 1mo ago | |
| AdvBench (test) | HMNS | ASR (GPT-4o)99 | 27 | 1mo ago | |
| AdvBench 20% evaluation | A-LQR+ | ASR97.12 | 25 | 1mo ago | |
| EHRAgent ALL | JailAgent | Weighted Average ASR70.112 | 24 | 1mo ago | |
| ConceptRisk | VII | Attack Success Rate68 | 24 | 3mo ago | |
| COCO-I2VSafetyBench | ASR17 | 24 | 3mo ago | ||
| Unsafe Prompts | DACA | Bypass Success Rate (Text)98.5 | 22 | 2mo ago | |
| JailbreakBench | VAE-JB | ASR (Detoxify)0 | 20 | 1mo ago |