| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| AdvBench | AMIS | ASR100 | 114 | 15d ago | |
| AdvBench | JULI | BERT Score4.84 | 55 | 1mo ago | |
| HARMBENCH 159 standard behaviors (test) | ASR0 | 51 | 1mo ago | ||
| MHSC | TCBS-Attack | ASR-431 | 44 | 1mo ago | |
| Q16 | TCBS-Attack | ASR-452.5 | 44 | 1mo ago | |
| AdvBench Sub | BERT Score4.73 | 40 | 1mo ago | ||
| JBB Behaviors | AMIS | ASR100 | 35 | 15d ago | |
| AdvBench (test) | CHaRS-PCT | Average ASR99.04 | 33 | 1mo ago | |
| MaliciousInstruct (test) | JULI | BERT Score4.63 | 32 | 1mo ago | |
| EHRAgent TREQS | SR71.78 | 30 | 10d ago | ||
| EHRAgent eICU | Success Rate (SR)57.93 | 30 | 10d ago | ||
| EHRAgent MIMIC-III | SR56.55 | 30 | 10d ago | ||
| StrongReject (test) | HMNS | ASR (GPT-4o)96 | 27 | 4d ago | |
| JBB-Behaviors (test) | HMNS | ASR (GPT-4o)99 | 27 | 4d ago | |
| HarmBench (test) | HMNS | ASR (GPT-4o)97 | 27 | 4d ago | |
| AdvBench (test) | HMNS | ASR (GPT-4o)99 | 27 | 4d ago | |
| EHRAgent ALL | JailAgent | Weighted Average ASR70.112 | 24 | 10d ago | |
| ConceptRisk | VII | Attack Success Rate68 | 24 | 1mo ago | |
| COCO-I2VSafetyBench | ASR17 | 24 | 1mo ago | ||
| Unsafe Prompts | DACA | Bypass Success Rate (Text)98.5 | 22 | 1mo ago | |
| JailbreakBench | Attack Success Rate (ASR)2 | 21 | 1mo ago | ||
| JailbreakBench | VAE-JB | ASR (Detoxify)0 | 20 | 4d ago | |
| StrongREJECT | VAE-JB | ASR (Detoxify)0 | 20 | 4d ago | |
| AdvBench | VAE-JB | ASR (Detoxify)0 | 20 | 4d ago | |
| GPT-4o | ICON | ASR0.99 | 19 | 24d ago |