Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Do Adversarially Robust ImageNet Models Transfer Better?

About

Transfer learning is a widely-used paradigm in deep learning, where models pre-trained on standard datasets can be efficiently adapted to downstream tasks. Typically, better pre-trained models yield better transfer results, suggesting that initial accuracy is a key aspect of transfer learning performance. In this work, we identify another such aspect: we find that adversarially robust models, while less accurate, often perform better than their standard-trained counterparts when used for transfer learning. Specifically, we focus on adversarially robust ImageNet classifiers, and show that they yield improved accuracy on a standard suite of downstream classification tasks. Further analysis uncovers more differences between robust and standard models in the context of transfer learning. Our results are consistent with (and in fact, add to) recent hypotheses stating that robustness leads to improved feature representations. Our code and models are available at https://github.com/Microsoft/robust-models-transfer .

Hadi Salman, Andrew Ilyas, Logan Engstrom, Ashish Kapoor, Aleksander Madry• 2020

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-1k (val)--
1453
Image ClassificationCIFAR-100 standard (test)
Top-1 Accuracy84.21
133
Image ClassificationCIFAR-10 standard (test)
Accuracy97.41
97
Image ClassificationImageNet RobustBench (val)
Clean Accuracy68.82
36
Adversarial AttackImageNet
Parsimon52.08
19
Adversarial AttackImageNet
Parsimon58.6
19
Image ClassificationCIFAR-10 (test)
SA Score84.73
16
Adversarial Image ClassificationCIFAR-100 (test)
SA54.81
16
Image ClassificationImageNet (test)
Standard Accuracy63.86
7
Image ClassificationRestricted ImageNet (test)
Clean Accuracy86.72
6
Showing 10 of 12 rows

Other info

Code

Follow for update