Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ICCU: In-Context Continual Unlearning via Pattern-Induced Refusal Rules

About

Machine unlearning aims to remove the influence of specific data from trained language models. In real-world deployments, unlearning requests often arrive sequentially, which challenges existing fine-tuning-based methods: fine-tuning each request is costly, accumulates utility loss, and may cause cross-request interference. To address these issues, we propose ICCU (In-Context Continual Unlearning), an in-context continual unlearning framework that induces readable refusal rules from unlearning datasets and applies them at inference time either as a filter or via the system prompt, without modifying model parameters. Because rules are accumulated as an order-independent union, ICCU is compositional and free of cross-request interference, and the original forget-set data can be discarded after rule induction. Extensive experiments show that ICCU effectively suppresses target knowledge while preserving utility, scales across sequential requests, and remains robust to paraphrased and cross-lingual queries.

Ruihao Pan, Suhang Wang• 2026

Related benchmarks

TaskDatasetResultRank
Machine UnlearningTOFU (5%)--
59
Machine UnlearningTOFU (1%)--
36
End-to-end unlearningTOFU (10%)
ROUGE-L Score (Forget)7.9
14
Language UnderstandingMMLU
MMLU Accuracy73.3
12
UnlearningWMDP
Bio Accuracy28.9
12
Machine UnlearningTOFU (forget01)--
9
Machine UnlearningTOFU (Forget05)--
8
Machine UnlearningTOFU (Retain99)
Refusal Rate7.5
6
Machine UnlearningTOFU Retain95
Refusal Rate3.9
6
Machine UnlearningTOFU Retain90
Refusal Rate0.6
6
Showing 10 of 23 rows

Other info

Follow for update