Which Concepts to Forget and How to Refuse? Decomposing Concepts for Continual Unlearning in Large Vision-Language Models

About

Continual unlearning poses the challenge of enabling large vision-language models to selectively refuse specific image-instruction pairs in response to sequential deletion requests, while preserving general utility. However, sequential unlearning updates distort shared representations, creating spurious associations between vision-language pairs and refusal behaviors that hinder precise identification of refusal targets, resulting in inappropriate refusals. To address this challenge, we propose a novel continual unlearning framework that grounds refusal behavior in fine-grained descriptions of visual and textual concepts decomposed from deletion targets. We first identify which visual-linguistic concept combinations characterize each forget category through a concept modulator, then determine how to generate appropriate refusal responses via a mixture of refusal experts, termed refusers, each specialized for concept-aligned refusal generation. To generate concept-specific refusal responses across sequential tasks, we introduce a multimodal, concept-driven routing scheme that reuses refusers for tasks sharing similar concepts and adapts underutilized ones for novel concepts. Extensive experiments on vision-language benchmarks demonstrate that the proposed framework outperforms existing methods by generating concept-grounded refusal responses and preserving the general utility across unlearning sequences.

Hyundong Jin, Dongyoon Han, Eunwoo Kim• 2026

Related benchmarks

Task	Dataset	Result
Knowledge Unlearning	16-task Sequential Unlearning Forgotten Data Avg	CRR90.9	18
Knowledge Unlearning	16-task Sequential Unlearning Forgotten Data Last	Context-aware Refusal Rate (CRR)84.54	16
Knowledge Retention	Sequential Unlearning 16-task Retained Data Avg	Specificity95.09	9
Knowledge Retention	16-task Sequential Unlearning Retained Data Last	Specificity96.54	8

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord