| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Jailbreak attack | DeepSeek-7b five finetuned variants | Average ASR3.8 | 16 | |
| Jailbreak Attack | deepseek-7b v1 (pretrained) | ASR (%)100 | 13 | |
| Constrained LLM Decoding | DeepSeek-V2-Lite-Chat 15.7B | Inference Time (ms)49.91 | 10 | |
| Jailbreaking | DeepSeek V3.2 | Attack Success Rate78.5 | 9 | |
| Detection of paraphrased text | DeepSeek Paraphrased V3 | ROC AUC (1% FPR)0.4178 | 8 | |
| Policy Corruption Evaluation | DeepSeek V3 | Compliance4.12 | 5 |