Adversarial Training for Free!

About

Adversarial training, in which a network is trained on adversarial examples, is one of the few defenses against adversarial attacks that withstands strong attacks. Unfortunately, the high cost of generating strong adversarial examples makes standard adversarial training impractical on large-scale problems like ImageNet. We present an algorithm that eliminates the overhead cost of generating adversarial examples by recycling the gradient information computed when updating model parameters. Our "free" adversarial training algorithm achieves comparable robustness to PGD adversarial training on the CIFAR-10 and CIFAR-100 datasets at negligible additional cost compared to natural training, and can be 7 to 30 times faster than other strong adversarial training methods. Using a single workstation with 4 P100 GPUs and 2 days of runtime, we can train a robust model for the large-scale ImageNet classification task that maintains 40% accuracy against PGD attacks. The code is available at https://github.com/ashafahi/free_adv_train.

Ali Shafahi, Mahyar Najibi, Amin Ghiasi, Zheng Xu, John Dickerson, Christoph Studer, Larry S. Davis, Gavin Taylor, Tom Goldstein• 2019

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-10 (test)	Accuracy86.11	3381
Natural Language Understanding	GLUE (dev)	SST-2 (Acc)96.1	529
Image Classification	CUB-200	Accuracy63.67	117
Image Classification	CIFAR-100	Clean Accuracy53.23	90
Image Classification	CIFAR-10	Clean Accuracy77.61	89
Adversarial Robustness	CIFAR-10 (test)	--	76
Image Classification	PathMNIST	Clean Accuracy78.32	60
Image Classification	CIFAR-10 (test)	Natural Accuracy76.52	48
Medical Image Classification	PathMNIST	Clean Accuracy78.32	48
Adversarial Robustness	CIFAR-100 (test)	--	46

Showing 10 of 31 rows

Other info

Code

Follow for update

@wizwand_team Discord