SORA: Free Second-Order Attacks in Fast Adversarial Training

About

Adversarial Training (AT) is a leading defense against adversarial examples but often suffers from Catastrophic Overfitting (CO) in efficient single-step variants, where robustness to multi-step attacks collapses despite high single-step performance. We address this failure mode with two contributions. First, we formalize Epsilon Overfitting (EO), a perspective in which fixed perturbation magnitudes and directions exacerbate CO, and show that introducing perturbation variability significantly improves robust generalization across different architectures and datasets. Second, we propose PertAlign (Perturbation Alignment), a theoretically grounded, computationally negligible metric that predicts CO onset by measuring gradient alignment across attack stages. Leveraging these insights, we introduce SORA, an adaptive step-size AT method that dynamically adjusts perturbations based on loss surface geometry. SORA consistently prevents CO, achieves state-of-the-art robustness and clean accuracy, and generalizes across datasets and architectures using a single fixed set of hyperparameters, which is essential for applicability in fast AT. Extensive experiments on diverse datasets and architectures show that SORA matches or surpasses the robustness of prior methods while delivering higher clean accuracy and superior efficiency. Code is available at https://github.com/SecondOrderAT/SORA.

Mazdak Teymourian, Ramtin Moslemi, Farzan Rahmani, Mohammad Hossein Rohban• 2026

Related benchmarks

Task	Dataset	Result
Image Classification	ImageNet-100 (test)	Clean Accuracy57.26	189
Image Classification	CIFAR-10	Clean Accuracy80.17	175
Image Classification	CIFAR-100	Clean Accuracy58.58	139
Image Classification	PathMNIST	Clean Accuracy86.53	60
Medical Image Classification	PathMNIST	Clean Accuracy86.53	48
Image Classification	TinyImageNet	Clean Accuracy54.59	30
Image Classification	TissueMNIST MedMNIST v2 (test)	Clean Accuracy58.71	29
Image Classification	TissueMNIST	Clean Accuracy60.68	20
Image Classification	CIFAR-100 (test)	Clean Accuracy53.61	16
Image Classification	CIFAR-10 (test)	Accuracy (Clean)83.44	16

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord