Ablating Concepts in Text-to-Image Diffusion Models

About

Large-scale text-to-image diffusion models can generate high-fidelity images with powerful compositional ability. However, these models are typically trained on an enormous amount of Internet data, often containing copyrighted material, licensed images, and personal photos. Furthermore, they have been found to replicate the style of various living artists or memorize exact training samples. How can we remove such copyrighted concepts or images without retraining the model from scratch? To achieve this goal, we propose an efficient method of ablating concepts in the pretrained model, i.e., preventing the generation of a target concept. Our algorithm learns to match the image distribution for a target style, instance, or text prompt we wish to ablate to the distribution corresponding to an anchor concept. This prevents the model from generating target concepts given its text condition. Extensive experiments show that our method can successfully prevent the generation of the ablated concept while preserving closely related concepts in the model.

Nupur Kumari, Bingliang Zhang, Sheng-Yu Wang, Eli Shechtman, Richard Zhang, Jun-Yan Zhu• 2023

Related benchmarks

Task	Dataset	Result
Text-to-Image Generation	MS-COCO	FID22.63	145
Text-to-Image Generation	MS-COCO (30K)	FID (30K)21.55	72
Coarse-grained Unlearning	Imagenette	Atar100	70
Object Erasure	CIFAR-10	Accuracy (Erase)99.55	62
Text-to-Image Alignment	MS-COCO	CLIP Score31.58	60
Text-to-Image Generation	MSCOCO 30K	FID14.08	54
Explicit Content Removal	I2P	Buttocks Count10	47
Nudity Erasure	I2P	Total Count390	44
Concept Erasure	Van Gogh style	FID17.5	39
Concept Unlearning	UnlearnDiffAtk	UnlearnDiffAtk0.4296	36

Showing 10 of 185 rows

...

Other info

Follow for update

@wizwand_team Discord