Share your thoughts, 1 month free Claude Pro on usSee more

Unlearning on MUSE-Books Harry Potter (forget set 500 samples)

39.99R-Forget-500

Base Model (Llama3.2-3B)

Updated 4mo ago

Evaluation Results

Method	Links
Base Model (Llama3.2-3B) 2026.01		39.99
GA + KL (Dr) 2026.01		38.29
Refusal-Training 2026.01		37.75
GA (DQA_f) + KL (Dr) 2026.01		36.87
NPO (DQA_f) 2026.01		34.28
NPO + KL (Dr) 2026.01		33.62
NPO 2026.01		26.83
NPO (DQA_f) + KL (Dr) 2026.01		25.6
SimNPO 2026.01		21.41
DUET 2026.01		5.98
FLAT 2026.01		0.64
GA 2026.01		0
GA (DQA_f) 2026.01		0