Targeted Attack Improves Protection against Unauthorized Diffusion Customization

About

Diffusion models build a new milestone for image generation yet raising public concerns, for they can be fine-tuned on unauthorized images for customization. Protection based on adversarial attacks rises to encounter this unauthorized diffusion customization, by adding protective watermarks to images and poisoning diffusion models. However, current protection, leveraging untargeted attacks, does not appear to be effective enough. In this paper, we propose a simple yet effective improvement for the protection against unauthorized diffusion customization by introducing targeted attacks. We show that by carefully selecting the target, targeted attacks significantly outperform untargeted attacks in poisoning diffusion models and degrading the customization image quality. Extensive experiments validate the superiority of our method on two mainstream customization methods of diffusion models, compared to existing protections. To explain the surprising success of targeted attacks, we delve into the mechanism of attack-based protections and propose a hypothesis based on our observation, which enhances the comprehension of attack-based protections. To the best of our knowledge, we are the first to both reveal the vulnerability of diffusion models to targeted attacks and leverage targeted attacks to enhance protection against unauthorized diffusion customization. Our code is available on GitHub: https://github.com/psyker-team/mist-v2.

Boyang Zheng, Chumeng Liang, Xiaoyu Wu• 2023

Related benchmarks

Task	Dataset	Result
Mimicry Defense	TI-Dataset	FID32.24	99
Training-based Mimicry Defense	TI-Dataset, DB-Dataset, CelebA-HQ, VGGFace2, WikiArt Average	FID97.04	52
Identity Protection	CelebA (test)	ISM74.5	48
Personalized Image Generation Protection	CelebA-HQ	FDSR1	46
Identity Protection	VGG-Face (test)	FDR0.988	32
Facial Privacy Protection	VggFace2	FDSR57.3	28
Inference-based Mimicry Defense	TI-Dataset, DB-Dataset, CelebA-HQ, VGGFace2, WikiArt Average across 5 datasets	FID47.87	26
Image Immunization	StableDiffusion 1.4 (test)	PSNR16.05	20
Image Immunization	HQ-Edit (Unseen Prompts)	PSNR (dB)9.32	16
Immunization against image editing	SD14 to SD3 Cross-model transfer	PSNR21.24	16

Showing 10 of 36 rows

Other info

Follow for update

@wizwand_team Discord