| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Detection of paraphrased text | GPT-4o-mini Paraphrased | ROC AUC (FPR=1%)0.4231 | 8 | |
| Denial-of-Service Attack | GPT-4o-mini 2024-07-18 (test) | Response Length16,384 | 6 | |
| Policy Corruption Evaluation | GPT-4o mini | Compliance Score3.53 | 5 | |
| Targeted Adversarial Attack | GPT-4o | ASR860 | 4 |