Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

An Orthogonal Classifier for Improving the Adversarial Robustness of Neural Networks

About

Neural networks are susceptible to artificially designed adversarial perturbations. Recent efforts have shown that imposing certain modifications on classification layer can improve the robustness of the neural networks. In this paper, we explicitly construct a dense orthogonal weight matrix whose entries have the same magnitude, thereby leading to a novel robust classifier. The proposed classifier avoids the undesired structural redundancy issue in previous work. Applying this classifier in standard training on clean data is sufficient to ensure the high accuracy and good robustness of the model. Moreover, when extra adversarial samples are used, better robustness can be further obtained with the help of a special worst-case loss. Experimental results show that our method is efficient and competitive to many state-of-the-art defensive approaches. Our code is available at \url{https://github.com/MTandHJ/roboc}.

Cong Xu, Xiang Li, Min Yang• 2021

Related benchmarks

TaskDatasetResultRank
ClassificationFashionMNIST l1, epsilon=10 (test)
Clean Accuracy92.56
8
ClassificationFashionMNIST (l2, epsilon=2) (test)
PGD (100) Robust Accuracy82.91
8
ClassificationFashionMNIST linf, epsilon=0.3 (test)
Robust Accuracy (FGSM)88.44
8
Image ClassificationCIFAR-10
Clean Accuracy92.62
8
Image ClassificationCIFAR-10 (test)
Accuracy (Clean)92.62
8
Showing 5 of 5 rows

Other info

Code

Follow for update