Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses

About

Advances in the development of adversarial attacks have been fundamental to the progress of adversarial defense research. Efficient and effective attacks are crucial for reliable evaluation of defenses, and also for developing robust models. Adversarial attacks are often generated by maximizing standard losses such as the cross-entropy loss or maximum-margin loss within a constraint set using Projected Gradient Descent (PGD). In this work, we introduce a relaxation term to the standard loss, that finds more suitable gradient-directions, increases attack efficacy and leads to more efficient adversarial training. We propose Guided Adversarial Margin Attack (GAMA), which utilizes function mapping of the clean image to guide the generation of adversaries, thereby resulting in stronger attacks. We evaluate our attack against multiple defenses and show improved performance when compared to existing attacks. Further, we propose Guided Adversarial Training (GAT), which achieves state-of-the-art performance amongst single-step defenses by utilizing the proposed relaxation term for both attack generation and training.

Gaurang Sriramanan, Sravanti Addepalli, Arya Baburaj, R. Venkatesh Babu• 2020

Related benchmarks

Task	Dataset	Result
Adversarial Robustness	CIFAR-10 (test)	--	76
Adversarial Robustness	CIFAR-100 (test)	--	46
Image Classification	CIFAR-10 (test)	Accuracy81.64	31
Adversarial Robustness	CIFAR-10	FGSM Robust Accuracy64.3	30
Adversarial Image Classification	CIFAR-100	Clean Accuracy57.58	24
Image Classification	CIFAR-100 WRN34-10 (test)	SA Success Rate65.71	22
Image Classification	CIFAR-100 (test)	SA59.92	22
Image Classification	Tiny ImageNet (test)	Standard Accuracy49.75	22
Image Classification	Tiny-ImageNet	Clean Accuracy46	22
Image Classification	CIFAR10 (test)	Accuracy (Natural)82.17	21

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord