Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SORA: Free Second-Order Attacks in Fast Adversarial Training

About

Adversarial Training (AT) is a leading defense against adversarial examples but often suffers from Catastrophic Overfitting (CO) in efficient single-step variants, where robustness to multi-step attacks collapses despite high single-step performance. We address this failure mode with two contributions. First, we formalize Epsilon Overfitting (EO), a perspective in which fixed perturbation magnitudes and directions exacerbate CO, and show that introducing perturbation variability significantly improves robust generalization across different architectures and datasets. Second, we propose PertAlign (Perturbation Alignment), a theoretically grounded, computationally negligible metric that predicts CO onset by measuring gradient alignment across attack stages. Leveraging these insights, we introduce SORA, an adaptive step-size AT method that dynamically adjusts perturbations based on loss surface geometry. SORA consistently prevents CO, achieves state-of-the-art robustness and clean accuracy, and generalizes across datasets and architectures using a single fixed set of hyperparameters, which is essential for applicability in fast AT. Extensive experiments on diverse datasets and architectures show that SORA matches or surpasses the robustness of prior methods while delivering higher clean accuracy and superior efficiency. Code is available at https://github.com/SecondOrderAT/SORA.

Mazdak Teymourian, Ramtin Moslemi, Farzan Rahmani, Mohammad Hossein Rohban• 2026

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-100 (test)
Clean Accuracy57.26
123
Image ClassificationCIFAR-100
Clean Accuracy58.58
90
Image ClassificationCIFAR-10
Clean Accuracy80.17
89
Image ClassificationPathMNIST
Clean Accuracy86.53
60
Medical Image ClassificationPathMNIST
Clean Accuracy86.53
48
Image ClassificationTinyImageNet
Clean Accuracy54.59
30
Image ClassificationTissueMNIST MedMNIST v2 (test)
Clean Accuracy58.71
29
Image ClassificationTissueMNIST
Clean Accuracy60.68
20
Image ClassificationCIFAR-100 (test)
Clean Accuracy53.61
16
Image ClassificationCIFAR-10 (test)
Accuracy (Clean)83.44
16
Showing 10 of 12 rows

Other info

Follow for update