Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MACE: Mass Concept Erasure in Diffusion Models

About

The rapid expansion of large-scale text-to-image diffusion models has raised growing concerns regarding their potential misuse in creating harmful or misleading content. In this paper, we introduce MACE, a finetuning framework for the task of mass concept erasure. This task aims to prevent models from generating images that embody unwanted concepts when prompted. Existing concept erasure methods are typically restricted to handling fewer than five concepts simultaneously and struggle to find a balance between erasing concept synonyms (generality) and maintaining unrelated concepts (specificity). In contrast, MACE differs by successfully scaling the erasure scope up to 100 concepts and by achieving an effective balance between generality and specificity. This is achieved by leveraging closed-form cross-attention refinement along with LoRA finetuning, collectively eliminating the information of undesirable concepts. Furthermore, MACE integrates multiple LoRAs without mutual interference. We conduct extensive evaluations of MACE against prior methods across four different tasks: object erasure, celebrity erasure, explicit content erasure, and artistic style erasure. Our results reveal that MACE surpasses prior methods in all evaluated tasks. Code is available at https://github.com/Shilin-LU/MACE.

Shilin Lu, Zilan Wang, Leyang Li, Yanzhu Liu, Adams Wai-Kin Kong• 2024

Related benchmarks

TaskDatasetResultRank
Text-to-Image GenerationMS-COCO
FID18.36
75
Continual Concept Learning10 Sequential Concepts (test)
UA99
70
Text-to-Image GenerationMSCOCO 30K
FID12.71
42
Art Style ErasureArtist style and content prompts 5 groups SD v1.4 based (test)
CS Style28.452
40
Concept UnlearningUnlearnDiffAtk
UnlearnDiffAtk0.176
36
Explicit Content RemovalI2P
Armpits Count17
28
Image GenerationMS-COCO 30k (val)
FID13.42
22
Concept UnlearningRing-a-Bell
Ring-A-Bell Score7.6
20
Safe Text-to-Image GenerationMMA-Diffusion
Automatic Safety Rate18.3
20
Machine UnlearningImagenette
Accuracy (garbage truck)76.7
18
Showing 10 of 79 rows
...

Other info

Code

Follow for update