Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

WMDP

Benchmarks

Task NameDataset NameSOTA ResultTrend
Machine UnlearningWMDP
Bio Accuracy24.8
74
Harmful Knowledge EvaluationWMDP evil
WMDP-evil Score65.37
60
Knowledge UnlearningWMDP bio
Accuracy71.2
51
Knowledge UnlearningWMDP cyber
Accuracy47.21
47
Question AnsweringWMDP Biology
Default Score64.5
38
Question AnsweringWMDP Cyber QA
Default Accuracy44.3
38
Knowledge RetentionWMDP retain
Retain48.3
36
Knowledge RecoveryWMDP-Bio 100-sample subset
ASR0.93
36
Machine UnlearningWMDP Cyber (test)
MMLU61.15
29
Machine UnlearningWMDP-cyber 1.0 (test)
BF16 Score53.7
28
Machine UnlearningWMDP-chem 1.0 (test)
BF160.56
28
Machine UnlearningWMDP-bio 1.0 (test)
BF16 Accuracy80.3
28
Knowledge UnlearningWMDP
Performance (Bio)75.9
26
Hazard Knowledge EvaluationWMDP
Accuracy68.98
26
UnlearningWMDP retain
Retain55.6
22
UnlearningWMDP (forget split)
BF16 Precision55.4
22
Fluency AssessmentWMDP
Mean Fluency3.46
22
Machine UnlearningWMDP
Acc (Bio)74.16
21
Dangerous Knowledge UnlearningWMDP
S-unlearning Score43
16
Knowledge RetentionWMDP cyber (retain)
Rt54.1
16
Machine UnlearningWMDP-cyber forget-set
BF16 Performance53.7
16
Knowledge RetentionWMDP-chem (retain)
Rt (Knowledge Retention)56
16
Machine UnlearningWMDP chem forget-set
BF16 Score56
16
Knowledge RetentionWMDP bio (retain)
Rt80.3
16
Structural ErasureWMDP-cyber
CAD0
16
Showing 25 of 70 rows