Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Machine Unlearning on WMDP-Cyber (Pre/Post Accuracy)

41.5Accuracy (Pre-Unlearning)

Original model

23.92428.48733.0537.613Apr 15, 2026
Updated 3d ago

Evaluation Results

MethodLinks
41.5-
2026.04
40.841.4
2026.04
40.239.8
2026.04
39.641
2026.04
37.639.5
2026.04
27.437.4
2026.04
24.626.5