Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Policy Corruption Evaluation on DeepSeek V3
Loading...
4.12
Compliance
HPM
1.3016
2.0333
2.765
3.4967
Dec 20, 2025
Compliance
Trustfulness
Recklessness
Harm Violation
Value Drift
Self Doubt
Confusion
Updated 4d ago
Evaluation Results
Method
Method
Links
Compliance
Trustfulness
Recklessness
Harm Violation
Value Drift
Self Doubt
Confusion
HPM
Victim Model=DeepSeek-V3
2025.12
4.12
3.9
3.61
4.2
3.33
3.45
3.7
PAP
Victim Model=DeepSeek-V3
2025.12
2.58
2.45
2.2
2.55
1.7
2.05
1.92
CoA
Victim Model=DeepSeek-V3
2025.12
2.22
2.01
1.73
2.3
1.22
1.25
1.41
PAIR
Victim Model=DeepSeek-V3
2025.12
1.7
1.42
1.21
1.53
1.01
0.72
0.9
AutoDAN
Victim Model=DeepSeek-V3
2025.12
1.41
1.25
1.02
1.2
0.75
0.51
0.63
Feedback
Search any
task
Search any
task