Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models

About

Existing unlearning algorithms in text-to-image generative models often fail to preserve the knowledge of semantically related concepts when removing specific target concepts: a challenge known as adjacency. To address this, we propose FADE (Fine grained Attenuation for Diffusion Erasure), introducing adjacency aware unlearning in diffusion models. FADE comprises two components: (1) the Concept Neighborhood, which identifies an adjacency set of related concepts, and (2) Mesh Modules, employing a structured combination of Expungement, Adjacency, and Guidance loss components. These enable precise erasure of target concepts while preserving fidelity across related and unrelated concepts. Evaluated on datasets like Stanford Dogs, Oxford Flowers, CUB, I2P, Imagenette, and ImageNet1k, FADE effectively removes target concepts with minimal impact on correlated concepts, achieving atleast a 12% improvement in retention performance over state-of-the-art methods.

Kartik Thakral, Tamar Glaser, Tal Hassner, Mayank Vatsa, Richa Singh• 2025

Related benchmarks

TaskDatasetResultRank
Coarse-grained UnlearningImagenette
Atar1.6
70
Concept ErasureCUB (test)
Aer100
21
Concept ErasureOxford Flowers (test)
Aer100
21
Concept ErasureStanford Dogs (test)
Aer100
21
Concept UnlearningStanford Dogs, Oxford Flowers, and CUB (Human Evaluation)
Aer51.94
6
Fine-grained Machine UnlearningImageNet-1k 1.0
ERB (Golf Ball)96.82
6
Showing 6 of 6 rows

Other info

Code

Follow for update