Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

WMDP

Benchmarks

Task NameDataset NameSOTA ResultTrend
Knowledge UnlearningWMDP bio
Accuracy71.2
42
Question AnsweringWMDP Biology
Default Score64.5
38
Question AnsweringWMDP Cyber QA
Default Accuracy44.3
38
Knowledge UnlearningWMDP cyber
Accuracy47.21
38
Knowledge RecoveryWMDP-Bio 100-sample subset
ASR0.93
36
Fluency AssessmentWMDP
Mean Fluency3.46
22
Machine UnlearningWMDP Cyber (test)
MMLU60.65
21
Hazard Knowledge EvaluationWMDP
Accuracy68.98
18
Machine UnlearningWMDP
Unlearn Score76.1
16
Unlearning DetectionWMDP
Accuracy100
16
Machine UnlearningWMDP
Bio Score64.7
15
Machine UnlearningWMDP
Acc (Bio)74.16
12
Knowledge UnlearningWMDP Bio (test)
Accuracy Forget64.81
11
Harmful Knowledge RemovalWMDP Bio
Acc_r78.5
10
TracingWMDP (test)
TSR100
10
Machine UnlearningWMDP average of biology and cyber
Accuracy0.557
10
Machine UnlearningWMDP Bio (test)
Bio Score63.7
10
UnlearningWMDP bio
WMDPbio Score0.628
9
UnlearningWMDP
WMDP Score0.489
9
Machine UnlearningWMDP bio
Multi-turn ASR Error Rate3.1
9
Machine UnlearningWMDP bio
Accuracy48.9
9
Machine UnlearningWMDP
Accuracy39.7
9
Machine UnlearningWMDP Cyber
Rel7.19
9
Machine UnlearningWMDP Bio
Rel Score6.72
9
Question AnsweringWMDP multiple-choice QA
Bio Accuracy65.1
9
Showing 25 of 37 rows