Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Efficient local linearity regularization to overcome catastrophic overfitting

About

Catastrophic overfitting (CO) in single-step adversarial training (AT) results in abrupt drops in the adversarial test accuracy (even down to 0%). For models trained with multi-step AT, it has been observed that the loss function behaves locally linearly with respect to the input, this is however lost in single-step AT. To address CO in single-step AT, several methods have been proposed to enforce local linearity of the loss via regularization. However, these regularization terms considerably slow down training due to Double Backpropagation. Instead, in this work, we introduce a regularization term, called ELLE, to mitigate CO effectively and efficiently in classical AT evaluations, as well as some more difficult regimes, e.g., large adversarial perturbations and long training schedules. Our regularization term can be theoretically linked to curvature of the loss function and is computationally cheaper than previous methods by avoiding Double Backpropagation. Our thorough experimental validation demonstrates that our work does not suffer from CO, even in challenging settings where previous works suffer from it. We also notice that adapting our regularization parameter during training (ELLE-A) greatly improves the performance, specially in large $\epsilon$ setups. Our implementation is available in https://github.com/LIONS-EPFL/ELLE .

Elias Abad Rocamora, Fanghui Liu, Grigorios G. Chrysos, Pablo M. Olmos, Volkan Cevher• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100
Clean Accuracy57.94
90
Image ClassificationCIFAR-10
Clean Accuracy83.68
89
Adversarial RobustnessCIFAR-10 (test)--
76
Image ClassificationPathMNIST
Clean Accuracy79.8
60
Medical Image ClassificationPathMNIST
Clean Accuracy79.65
48
Image ClassificationCIFAR-10 (test)
Accuracy82.14
31
Image ClassificationTinyImageNet
Clean Accuracy49.43
30
Image ClassificationTissueMNIST MedMNIST v2 (test)
Clean Accuracy60.09
29
Image ClassificationTiny ImageNet (test)
Standard Accuracy43.95
22
Image ClassificationCIFAR-100 WRN34-10 (test)
SA Success Rate55.24
22
Showing 10 of 15 rows

Other info

Follow for update