Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Vertex-Softmax: Tight Transformer Verification via Exact Softmax Optimization

About

Certified verification of transformer attention requires bounding the softmax function over interval constraints on the pre-softmax scores. Existing verifiers relax softmax ndependently of the downstream objective, leaving avoidable slack. We prove that the exact optimum of this score-box problem is attained at a vertex of the constraint box, and establish a threshold structure theorem showing that, after sorting the objective coefficients, the optimum lies among only linearly many candidates, yielding the Vertex-Softmax primitive with log-linear complexity in the sequence length. We further prove a formal optimality result showing that Vertex-Softmax is the tightest sound bound obtainable from score intervals alone, characterizing precisely what additional structure (score correlations, score-value coupling) is needed for further improvement. Integrated into a CROWN Convex Relaxation based Optimization for Worst-case Neurons)-style verifier with a formal soundness guarantee, Vertex-Softmax significantly improves certified rates and substantially tightens lower bounds across MNIST, Fashion-MNIST, and CIFAR-10 attention models, while consistently matching or outperforming alpha-CROWN and branch-and-bound baselines at a fraction of their cost.

Navid Rezazadeh, Arash Gholami Davoodi• 2026

Related benchmarks

TaskDatasetResultRank
Robustness CertificationAttention blocks
Certification Rate74.2
12
Robustness CertificationMNIST Binary
Certified Rate97.2
10
Image ClassificationFashion MNIST
Clean Accuracy70.8
8
Neural Network VerificationSmall-attention experiments
Time per Trial (s)0.03
6
Robustness CertificationResidual-MHA blocks
Certification Score33.7
6
Robustness CertificationMNIST 10-class
Certified Rate12.9
4
Image ClassificationCIFAR-10 gray
Clean Accuracy34.6
4
Showing 7 of 7 rows

Other info

Follow for update