Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Concept is More Than a Word: Diversified Unlearning in Text-to-Image Diffusion Models

About

Concept unlearning has emerged as a promising direction for reducing the risks of harmful content generation in text-to-image diffusion models by selectively erasing undesirable concepts from a model's parameters. Existing approaches typically rely on keywords to identify the target concept to be unlearned. However, we show that this keyword-based formulation is inherently limited: a visual concept is multi-dimensional, can be expressed in diverse textual forms, and often overlap with related concepts in the latent space, making keyword-only unlearning, which imprecisely indicate the target concept is brittle and prone to over-forgetting. This occurs because a single keyword represents only a narrow point estimate of the concept, failing to cover its full semantic distribution and entangled variations in the latent space. To address this limitation, we propose Diversified Unlearning, a distributional framework that represents a concept through a set of contextually diverse prompts rather than a single keyword. This richer representation enables more precise and robust unlearning. Through extensive experiments across multiple benchmarks and state-of-the-art baselines, we demonstrate that integrating Diversified Unlearning as an add-on component into existing unlearning pipelines consistently achieves stronger erasure, better retention of unrelated concepts, and improved robustness against adversarial recovery attacks.

Duc Hao Pham, Van Duy Truong, Duy Khanh Dinh, Tien Cuong Nguyen, Dien Hy Ngo, Tuan Anh Bui• 2026

Related benchmarks

TaskDatasetResultRank
Nudity ErasureI2P
Total Count244
38
Content PreservationMS-COCO (30K)
FID13.82
11
Object ErasureFive Objects prompts
ESR (1 Object)98.8
11
Object ErasureCassette Player prompts
ESR (k=1)100
11
Object ErasureGrabage Truck prompts
ESR (1)100
11
Artistic Style ErasureKelly McKernan To Erase
CLIP-t Score33.29
10
Celebrity Preservation15 Celebrities Preservation Benchmark (1,500 prompts)
CLIP-i85.91
10
Concept ErasureMario erasure benchmark 1,000-image
CLIP-i Score71.97
10
Concept Preservation10 Other Copyrighted Characters 1,000 prompts
LPIPS0.27
10
Artistic Style ErasureKelly McKernan To Preserve
CLIP-t29.73
10
Showing 10 of 18 rows

Other info

Follow for update