Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Targeted Attack Improves Protection against Unauthorized Diffusion Customization

About

Diffusion models build a new milestone for image generation yet raising public concerns, for they can be fine-tuned on unauthorized images for customization. Protection based on adversarial attacks rises to encounter this unauthorized diffusion customization, by adding protective watermarks to images and poisoning diffusion models. However, current protection, leveraging untargeted attacks, does not appear to be effective enough. In this paper, we propose a simple yet effective improvement for the protection against unauthorized diffusion customization by introducing targeted attacks. We show that by carefully selecting the target, targeted attacks significantly outperform untargeted attacks in poisoning diffusion models and degrading the customization image quality. Extensive experiments validate the superiority of our method on two mainstream customization methods of diffusion models, compared to existing protections. To explain the surprising success of targeted attacks, we delve into the mechanism of attack-based protections and propose a hypothesis based on our observation, which enhances the comprehension of attack-based protections. To the best of our knowledge, we are the first to both reveal the vulnerability of diffusion models to targeted attacks and leverage targeted attacks to enhance protection against unauthorized diffusion customization. Our code is available on GitHub: https://github.com/psyker-team/mist-v2.

Boyang Zheng, Chumeng Liang, Xiaoyu Wu• 2023

Related benchmarks

TaskDatasetResultRank
Identity ProtectionCelebA (test)
ISM74.5
48
Identity ProtectionVGG-Face (test)
FDR0.988
32
Image ImmunizationStableDiffusion 1.4 (test)
PSNR16.05
20
Image ImmunizationHQ-Edit (Unseen Prompts)
PSNR (dB)9.32
16
Immunization against image editingSD14 to SD3 Cross-model transfer
PSNR21.24
16
Image ImmunizationInstructPix2Pix Original Prompt
PSNR17.01
16
Image ImmunizationInstructPix2Pix (Unseen Prompts)
PSNR16.42
16
Image ImmunizationStableDiffusion v3 (test)
PSNR20.94
16
Image ImmunizationInstructPix2Pix INS (test)
PSNR16.12
10
Immunization against image editingSD14 to InstructPix2Pix Cross-model transfer
PSNR16.68
10
Showing 10 of 16 rows

Other info

Follow for update