| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| S-Eval Aattack | Attack Success Rate (ASR)92 | 72 | 4d ago | ||
| TRIDENT CORE | TRIDENT-CORE | HPR7 | 38 | 4d ago | |
| HarmBench (400 random samples) | Llama-2-7B-Chat (Original) | ASR0 | 18 | 4d ago | |
| StealthGraph SG-Implicit | ASR91 | 12 | 4d ago | ||
| AdvBench | CodeAttack | GPT-3.5 Success Rate94 | 8 | 4d ago | |
| TRIDENT-EDGE | TRIDENT-EDGE | HPR5 | 7 | 4d ago | |
| StealthGraph SG-Origin | ASR39.5 | 6 | 4d ago | ||
| HarmfulQA | ASR16 | 6 | 4d ago | ||
| Do-Not-Answer | ASR2.5 | 6 | 4d ago | ||
| FigStep Average | SafeThink | Average ASR0.053 | 5 | 4d ago |