Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Machine Unlearning on WMDP Cyber (ES, MCQ)
Loading...
46.7
ES
Base Model
18.62
25.91
33.2
40.49
Apr 5, 2026
ES
MCQ
Updated 12d ago
Evaluation Results
Method
Method
Links
ES
MCQ
Base Model
2026.04
46.7
44.8
Muon
Runtime=11.8
2026.04
30.5
28.6
POME
Runtime=9.9
2026.04
29.6
27.5
AdamW
Runtime=8.8
2026.04
26.3
29.4
BLUR
Runtime=9.4
2026.04
24.8
27.3
SIFT
Runtime=20.2
2026.04
19.7
26.4
Feedback
Search any
task
Search any
task