Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Dual Attention Guided Defense Against Malicious Edits

About

Recent progress in text-to-image diffusion models has transformed image editing via text prompts, yet this also introduces significant ethical challenges from potential misuse in creating deceptive or harmful content. While current defenses seek to mitigate this risk by embedding imperceptible perturbations, their effectiveness is limited against malicious tampering. To address this issue, we propose a Dual Attention-Guided Noise Perturbation (DANP) immunization method that adds imperceptible perturbations to disrupt the model's semantic understanding and generation process. DANP functions over multiple timesteps to manipulate both cross-attention maps and the noise prediction process, using a dynamic threshold to generate masks that identify text-relevant and irrelevant regions. It then reduces attention in relevant areas while increasing it in irrelevant ones, thereby misguides the edit towards incorrect regions and preserves the intended targets. Additionally, our method maximizes the discrepancy between the injected noise and the model's predicted noise to further interfere with the generation. By targeting both attention and noise prediction mechanisms, DANP exhibits impressive immunity against malicious edits, and extensive experiments confirm that our method achieves state-of-the-art performance.

Jie Zhang, Shuai Dong, Shiguang Shan, Xilin Chen• 2025

Related benchmarks

TaskDatasetResultRank
Image ImmunizationHQ-Edit (Unseen Prompts)
PSNR (dB)9.1
16
Image ImmunizationInstructPix2Pix Original Prompt
PSNR14.67
16
Image ImmunizationInstructPix2Pix (Unseen Prompts)
PSNR14.99
16
Image Editing ImmunizationStableDiffusion Original Prompt v1.4
PSNR14.63
8
Image Editing ImmunizationStableDiffusion v1.4 (Unseen Prompts)
PSNR15.53
8
Image ImmunizationHQ-Edit Original Prompt
PSNR8.77
8
Showing 6 of 6 rows

Other info

Follow for update