Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DeepSeek

Benchmarks

Task NameDataset NameSOTA ResultTrend
Jailbreak attackDeepSeek-7b five finetuned variants
Average ASR3.8
16
Jailbreak Attackdeepseek-7b v1 (pretrained)
ASR (%)100
13
Constrained LLM DecodingDeepSeek-V2-Lite-Chat 15.7B
Inference Time (ms)49.91
10
JailbreakingDeepSeek V3.2
Attack Success Rate78.5
9
Detection of paraphrased textDeepSeek Paraphrased V3
ROC AUC (1% FPR)0.4178
8
Policy Corruption EvaluationDeepSeek V3
Compliance4.12
5
Showing 6 of 6 rows