Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Forget to Know, Remember to Use: Context-Aware Unlearning for Large Language Models

About

Large language models may encode sensitive information or outdated knowledge that needs to be removed, to ensure responsible and compliant model responses. Unlearning has emerged as an efficient alternative to full retraining, aiming to remove specific knowledge while preserving overall model utility. Existing evaluations of unlearning methods focus on (1) the extent of forgetting of the target knowledge (forget set) and (2) maintaining performance on the retain set (i.e., utility). However, these evaluations overlook an important usability aspect: users may still want the model to leverage the removed information if it is re-introduced in the prompt. In a systematic evaluation of six state-of-the-art unlearning methods, we find that they consistently impair such contextual utility. To address this, we augment unlearning objectives with a plug-in term that preserves the model's ability to use forgotten knowledge when it is present in context. Extensive experiments demonstrate that our approach restores contextual utility to near original levels while still maintaining effective forgetting and retain-set utility.

Yuefeng Peng, Parnian Afshar, Megan Ganji, Thomas Butler, Amir Houmansadr, Mingxian Wang, Dezhi Hong• 2025

Related benchmarks

TaskDatasetResultRank
Contextual Question Answering5% (forget set)
ROUGE-L91
12
Direct Question Answering5% (forget set)
ROUGE-L Score36
12
Model Utility PreservationModel Utility (retain set)
Utility61
12
Contextual Question AnsweringGemma-2B-IT 5% forget set
ROUGE-L92.4
8
Direct Question AnsweringGemma-2B-IT 5% forget set
ROUGE-L47.1
8
Model UtilityGemma-2B-IT
Utility57.8
8
Contextual Question AnsweringPISTOL (A_B)
ROUGE-L97.6
6
Contextual Question AnsweringPISTOL (A_C)
ROUGE-L98
6
Direct Question AnsweringPISTOL (A_B)
ROUGE-L79.1
6
Direct Question AnsweringPISTOL (A_C)
ROUGE-L77.3
6
Showing 10 of 10 rows

Other info

Follow for update