Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Rethinking Evaluation Paradigms in IBP-based Certified Training

About

Deep neural networks achieve strong performance on many supervised learning tasks but remain vulnerable to adversarial perturbations. Neural network verification provides mathematically rigorous robustness guarantees, yet at substantial computational cost. To mitigate this, certified training techniques optimise for verifiable robustness during training, typically inducing a trade-off between natural and certified accuracy controlled by method-specific hyperparameters. Because these metrics are inherently conflicting, the common practice of reporting a single configuration is problematic: it can mislead conclusions about overall performance and prevents unbiased assessments of the state of the art. We address this by evaluating certified training methods via Pareto front comparisons over the natural--certified accuracy trade-off. To enable fair, method-agnostic comparisons, we perform efficient automated multi-objective hyperparameter optimisation to identify a set of Pareto-optimal configurations for each method. This approach often uncovers substantial undertuning in previously reported configurations, yielding superior performance and establishing a new state of the art. Leveraging these fronts, we present the first comprehensive multi-objective comparison of certified training approaches, showing that prior advancements are less pronounced than assumed and revealing previously unreported performance complementarities.

Konstantin Kaulen, Hadar Shavit, Holger H. Hoos• 2026

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR10 epsilon = 8/255 (test)
Clean Accuracy56.06
25
Certified Image ClassificationCIFAR-10 epsilon=2/255 (test)
Clean Accuracy81.96
16
Certified Image ClassificationTiny ImageNet epsilon=1/255 (test)
Clean Accuracy42.1
16
Robustness VerificationMNIST epsilon=0.3 (test)
Clean Accuracy98.8
16
Showing 4 of 4 rows

Other info

Follow for update