| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MMLU | INFA-GUARD | ASR@315 | 31 | 3d ago | |
| GSM8k | G-Safeguard | ASR@36 | 28 | 3d ago | |
| CSQA | G-Safeguard | ASR@318.33 | 28 | 3d ago | |
| MMLU random topology | Inspector | ASR (k=1)15.5 | 16 | 4d ago | |
| URL-based PI (200-sample dataset) | ASR33.5 | 12 | 4d ago | ||
| Spam Email | Separator Injection | ASR (None Defense)0.3 | 10 | 4d ago | |
| Negative Review | Separator Injection | ASR (None Defense)0 | 10 | 4d ago | |
| Toxic Comment | Topic Attack | ASR (None)100 | 10 | 4d ago | |
| GCG Clean | CAHL | ASR37.02 | 4 | 4d ago | |
| Representative guardrail dataset | ChainPoll | F1 Score97 | 3 | 4d ago |