| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Jailbreaking | GPT-4o | ASR0.99 | 9 | |
| Jailbreaking | GPT 5.1 | ASR96.5 | 9 | |
| Adversarial Attack | GPT-4o | CLIP Similarity (RN-50)0.259 | 9 | |
| Detection of paraphrased text | GPT Paraphrased 4.1 | ROC AUC (1% FPR)0.3977 | 8 | |
| Text-to-Video Generation | GPT-G | Semantic Objective76.8 | 4 | |
| Machine-generated text detection | GPT-3.5 (test) | Accuracy99.14 | 4 | |
| Text Generation | MiniGPT-4 | BLEU-148.1 | 3 |