Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DeepSeek

Benchmarks

Task NameDataset NameSOTA ResultTrend
Jailbreak AttackDeepSeek
NR Score0
20
Jailbreak attackDeepSeek-7b five finetuned variants
Average ASR3.8
16
Jailbreak Attackdeepseek-7b v1 (pretrained)
ASR (%)100
13
Constrained LLM DecodingDeepSeek-V2-Lite-Chat 15.7B
Inference Time (ms)49.91
10
JailbreakingDeepSeek V3.2
Attack Success Rate78.5
9
Detection of paraphrased textDeepSeek Paraphrased V3
ROC AUC (1% FPR)0.4178
8
Policy Corruption EvaluationDeepSeek V3
Compliance4.12
5
CPU Inference Performance EvaluationDeepSeek Lite V2
Memory Usage (GB)8.8
3
Weight Reconstruction FidelityDeepSeek-V3 Weights
Weight ΔW L2 Distance0
3
Showing 9 of 9 rows