Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability

About

Neural networks are known to be susceptible to adversarial samples: small variations of natural examples crafted to deliberately mislead the models. While they can be easily generated using gradient-based techniques in digital and physical scenarios, they often differ greatly from the actual data distribution of natural images, resulting in a trade-off between strength and stealthiness. In this paper, we propose a novel framework dubbed Diffusion-Based Projected Gradient Descent (Diff-PGD) for generating realistic adversarial samples. By exploiting a gradient guided by a diffusion model, Diff-PGD ensures that adversarial samples remain close to the original data distribution while maintaining their effectiveness. Moreover, our framework can be easily customized for specific tasks such as digital attacks, physical-world attacks, and style-based attacks. Compared with existing methods for generating natural-style adversarial samples, our framework enables the separation of optimizing adversarial loss from other surrogate losses (e.g., content/smoothness/style loss), making it more stable and controllable. Finally, we demonstrate that the samples generated using Diff-PGD have better transferability and anti-purification power than traditional gradient-based methods. Code will be released in https://github.com/xavihart/Diff-PGD

Haotian Xue, Alexandre Araujo, Bin Hu, Yongxin Chen• 2023

Related benchmarks

Task	Dataset	Result
Adversarial Attack	CIFAR-100	ASR (Average)81.8	56
Untargeted white-box adversarial attack	ImageNet	ASR92.1	40
Adversarial Attack	CIFAR-10	--	32
Camouflage attack on Object Detection	COCO	AP5068.5	16
Object Detection	LINZ (test)	AP5073.2	16
Untargeted white-box attack	Target Model: Vgg-19	Latency (s)5.77e+3	10
Untargeted white-box attack	Target Model: WideResNet-50	Time (s)5.97e+3	10
Untargeted white-box attack	Target Model: MobileNet-V2	Attack Time (s)5.94e+3	10
Adversarial Attack	ImageNet-1K	Time (s)10.4	8

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord