Benchmarking Neural Network Robustness to Common Corruptions and Perturbations

About

In this paper we establish rigorous benchmarks for image classifier robustness. Our first benchmark, ImageNet-C, standardizes and expands the corruption robustness topic, while showing which classifiers are preferable in safety-critical applications. Then we propose a new dataset called ImageNet-P which enables researchers to benchmark a classifier's robustness to common perturbations. Unlike recent robustness research, this benchmark evaluates performance on common corruptions and perturbations not worst-case adversarial perturbations. We find that there are negligible changes in relative corruption robustness from AlexNet classifiers to ResNet classifiers. Afterward we discover ways to enhance corruption and perturbation robustness. We even find that a bypassed adversarial defense provides substantial common perturbation robustness. Together our benchmarks may aid future work toward networks that robustly generalize.

Dan Hendrycks, Thomas Dietterich• 2019

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-10-C	Accuracy50.8	179
Image Classification	CIFAR-100-C	Accuracy (Corruption)32	109
Semantic segmentation	ACDC (test)	mIoU49.21	103
Inference Efficiency	MS-COCO	Sequence Length Delta-18.33	20
Inference Efficiency	ImageNet-1K	Inference Length-4.27	20
Efficiency Reduction	Subject B	Iteration Loop Count0.04	14
Efficiency Reduction	Subject D	Iteration Count-8.35	14
Efficiency Reduction	Subject C	Loop Count-2.32	14
Efficiency Reduction	Subject A	Loop Count (I)0.47	14
Image Classification	MNIST (test)	Clean Error Rate2.1	12

Showing 10 of 23 rows

Other info

Code

Follow for update

@wizwand_team Discord