Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers

About

Concept erasure in text-to-image diffusion models aims to disable pre-trained diffusion models from generating images related to a target concept. To perform reliable concept erasure, the properties of robustness and locality are desirable. The former refrains the model from producing images associated with the target concept for any paraphrased or learned prompts, while the latter preserves its ability in generating images with non-target concepts. In this paper, we propose Reliable Concept Erasing via Lightweight Erasers (Receler). It learns a lightweight Eraser to perform concept erasing while satisfying the above desirable properties through the proposed concept-localized regularization and adversarial prompt learning scheme. Experiments with various concepts verify the superiority of Receler over previous methods.

Chi-Pin Huang, Kai-Po Chang, Chung-Ting Tsai, Yung-Hsuan Lai, Fu-En Yang, Yu-Chiang Frank Wang• 2023

Related benchmarks

Task	Dataset	Result
Text-to-Image Generation	MS-COCO	FID70.1	193
Coarse-grained Unlearning	Imagenette	Atar47.88	70
Concept Erasure	Van Gogh style	FID169.6	39
Nudity Detection	I2P	Breast (F) Detections13	29
Style Erasure	Monet	Contrastive Similarity (CS)26.92	28
Style Erasure	Picasso	Contrastive Similarity (CS)26.16	28
Style Erasure	MS-COCO	CS Score25.99	28
Style Erasure	Caravaggio	CS25.34	28
Style Erasure	Paul Gauguin	CS26.51	28
Concept Erasure	I2P	I2P Success Rate13	23

Showing 10 of 68 rows

Other info

Follow for update

@wizwand_team Discord