Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers

About

Concept erasure in text-to-image diffusion models aims to disable pre-trained diffusion models from generating images related to a target concept. To perform reliable concept erasure, the properties of robustness and locality are desirable. The former refrains the model from producing images associated with the target concept for any paraphrased or learned prompts, while the latter preserves its ability in generating images with non-target concepts. In this paper, we propose Reliable Concept Erasing via Lightweight Erasers (Receler). It learns a lightweight Eraser to perform concept erasing while satisfying the above desirable properties through the proposed concept-localized regularization and adversarial prompt learning scheme. Experiments with various concepts verify the superiority of Receler over previous methods.

Chi-Pin Huang, Kai-Po Chang, Chung-Ting Tsai, Yung-Hsuan Lai, Fu-En Yang, Yu-Chiang Frank Wang• 2023

Related benchmarks

TaskDatasetResultRank
Coarse-grained UnlearningImagenette
Atar47.88
70
Concept ErasureStanford Dogs (test)
Aer100
21
Concept ErasureOxford Flowers (test)
Aer100
21
Concept ErasureCUB (test)
Aer100
21
Concept Unlearning (NSFW)IGMU (standard evaluation)
FSR99.25
12
Concept Unlearning PreservationNSFW
CSDR10.2
12
Common RobustnessI2P
ASR6.81
12
Common RobustnessMMA
ASR27.57
12
Adversarial RobustnessNSFW
ASR46.88
11
Concept Unlearning PreservationChurch
CSDR9.41
9
Showing 10 of 19 rows

Other info

Follow for update