Theoretically Principled Trade-off between Robustness and Accuracy

About

We identify a trade-off between robustness and accuracy that serves as a guiding principle in the design of defenses against adversarial examples. Although this problem has been widely studied empirically, much remains unknown concerning the theory underlying this trade-off. In this work, we decompose the prediction error for adversarial examples (robust error) as the sum of the natural (classification) error and boundary error, and provide a differentiable upper bound using the theory of classification-calibrated loss, which is shown to be the tightest possible upper bound uniform over all probability distributions and measurable predictors. Inspired by our theoretical analysis, we also design a new defense method, TRADES, to trade adversarial robustness off against accuracy. Our proposed algorithm performs well experimentally in real-world datasets. The methodology is the foundation of our entry to the NeurIPS 2018 Adversarial Vision Challenge in which we won the 1st place out of ~2,000 submissions, surpassing the runner-up approach by $11.41\%$ in terms of mean $\ell_2$ perturbation distance.

Hongyang Zhang, Yaodong Yu, Jiantao Jiao, Eric P. Xing, Laurent El Ghaoui, Michael I. Jordan• 2019

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-100 (test)	Accuracy62.37	3518
Image Classification	CIFAR-10 (test)	Accuracy88.64	3381
Image Classification	TinyImageNet (test)	Accuracy38.51	499
Image Classification	MNIST	Accuracy99.4	417
Image Classification	CIFAR-10 (test)	Accuracy (Clean)85.9	273
Image Classification	Fashion MNIST	Accuracy78.82	240
Image Classification	CIFAR10 (train)	Accuracy98.98	144
Image Classification	StanfordCars	--	100
Image Classification	GTSRB (test)	Accuracy (Clean)78.04	94
Image Classification	CIFAR-100	Clean Accuracy57.76	90

Showing 10 of 153 rows

...

Other info

Follow for update

@wizwand_team Discord