| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| PHEME New Attacks: ExplainDrive (test) | LLM-SGA/ARHOCD | Accuracy82.91 | 15 | 4d ago | |
| PHEME Known Attacks: DeepWordBug, TFAdjusted, TREPAT (test) | LLM-SGA/ARHOCD | Accuracy85.59 | 10 | 4d ago | |
| Standard Harmful Content Datasets Evasion Attack | GAVEL | Phishing96 | 3 | 4d ago | |
| Standard Harmful Content Datasets (Goal Hijacking Attack) | GAVEL | Phishing96 | 2 | 4d ago | |
| Standard Harmful Content Datasets Misdirection Attack | GAVEL | Phishing97 | 2 | 4d ago |