| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Average of six attacks | GradSafe | Avg Success Rate0 | 30 | 4d ago | |
| Zulu | IMAG | Accuracy92 | 30 | 4d ago | |
| Base64 | IMAG | Accuracy100 | 30 | 4d ago | |
| DrAttack | GradSafe | Accuracy99 | 30 | 4d ago | |
| PAIR | IMAG | Accuracy98 | 30 | 4d ago | |
| AutoDAN | IMAG | Accuracy99 | 30 | 4d ago | |
| GCG | IMAG | Accuracy99 | 30 | 4d ago | |
| GoalFrameBench | FrameShield-Crit | Accuracy94 | 24 | 4d ago | |
| GoalFrameBench (seed prompts) | FrameShield-Last | Accuracy97 | 16 | 4d ago | |
| ChatGPT Jailbreak Prompts | Recall100 | 15 | 4d ago | ||
| Wildjailbreak | Apriel Guard | F1 Score96 | 15 | 4d ago | |
| OKT | Correlation Score1 | 13 | 4d ago | ||
| SB | gpt-4o-mini | COR98.33 | 13 | 4d ago | |
| HB | gpt-4o-mini | Correctness Rate (COR)100 | 13 | 4d ago | |
| ADVB | gpt-4.1-mini | Accuracy100 | 13 | 4d ago | |
| AEG2 | gpt-5-mini | Accuracy79.94 | 13 | 4d ago | |
| L3J | GradSafe | Accuracy98.3 | 13 | 4d ago | |
| WGT | AISA | Accuracy90.17 | 13 | 4d ago | |
| XST | AISA | Accuracy95.11 | 13 | 4d ago | |
| ALL-4 | gpt-5-mini | Accuracy92.6 | 13 | 4d ago | |
| JBC | Accuracy99.2 | 13 | 4d ago | ||
| WJB | gpt-5-mini | ACC93.26 | 13 | 4d ago | |
| FQ-PH | ACC86.54 | 13 | 4d ago | ||
| EJ-OO | Accuracy99.17 | 13 | 4d ago | ||
| LLaVA Vicuna-7B v1.6 | KCD | Accuracy92 | 13 | 4d ago |