Diffusion Models for Imperceptible and Transferable Adversarial Attack

About

Many existing adversarial attacks generate $L_p$-norm perturbations on image RGB space. Despite some achievements in transferability and attack success rate, the crafted adversarial examples are easily perceived by human eyes. Towards visual imperceptibility, some recent works explore unrestricted attacks without $L_p$-norm constraints, yet lacking transferability of attacking black-box models. In this work, we propose a novel imperceptible and transferable attack by leveraging both the generative and discriminative power of diffusion models. Specifically, instead of direct manipulation in pixel space, we craft perturbations in the latent space of diffusion models. Combined with well-designed content-preserving structures, we can generate human-insensitive perturbations embedded with semantic clues. For better transferability, we further "deceive" the diffusion model which can be viewed as an implicit recognition surrogate, by distracting its attention away from the target regions. To our knowledge, our proposed method, DiffAttack, is the first that introduces diffusion models into the adversarial attack field. Extensive experiments on various model structures, datasets, and defense methods have demonstrated the superiority of our attack over the existing attack methods.

Jianqi Chen, Hao Chen, Keyan Chen, Yilan Zhang, Zhengxia Zou, Zhenwei Shi• 2023

Related benchmarks

Task	Dataset	Result
Adversarial Attack Transferability	CIFAR-100	Average ASR81.35	138
3D Human Pose and Shape Estimation	3DPW	--	74
Adversarial Attack	CIFAR-100	ASR (Average)81.35	56
Untargeted white-box adversarial attack	ImageNet	ASR97.8	40
Adversarial Attack	CIFAR-10	--	32
Adversarial Attack	ImageNet (val)	Attack Success Rate (ResNet-50)92.5	28
Adversarial Attack	ImageNet 1,000 image subset (val)	ASR (AT)54	24
Adversarial Attack	ImageNet	ASR (RN50)92.7	24
Adversarial Image Quality Assessment	ImageNet (test)	PSNR23.31	24
Vehicle Detection	LINZ	AP5098	12

Showing 10 of 21 rows

Other info

Follow for update

@wizwand_team Discord