S$^4$ST: A Strong, Self-transferable, faSt, and Simple Scale Transformation for Transferable Targeted Attack
About
Transferable Targeted Attacks (TTAs) face significant challenges due to severe overfitting to surrogate models. Recent breakthroughs heavily rely on large-scale training data of victim models, while data-free solutions, \textit{i.e.}, image transformation-involved gradient optimization, often depend on black-box feedback for method design and tuning. These dependencies violate black-box transfer settings and compromise threat evaluation fairness. In this paper, we propose two blind estimation measures, self-alignment and self-transferability, to analyze per-transformation effectiveness and cross-transformation correlations under strict black-box constraints. Our findings challenge conventional assumptions: (1) Attacking simple scaling transformations uniquely enhances targeted transferability, outperforming other basic transformations and rivaling leading complex methods; (2) Geometric and color transformations exhibit high internal redundancy despite weak inter-category correlations. These insights drive the design and tuning of S$^4$ST (Strong, Self-transferable, faSt, Simple Scale Transformation), which integrates dimensionally consistent scaling, complementary low-redundancy transformations, and block-wise operations. Extensive evaluations across diverse architectures, training distributions, and tasks show that S$^{4}$ST achieves state-of-the-art effectiveness-efficiency balance without data dependency. We reveal that scaling's effectiveness stems from visual data's multi-scale nature and ubiquitous scale augmentation during training, rendering such augmentation a double-edged sword. Further validations on medical imaging and face verification confirm the framework's strong generalization.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Instance Segmentation | COCO 2017 | -- | 226 | |
| Image Classification | CXR14 | AUC0.68 | 76 | |
| Adversarial Attack | ImageNet Augmix | Targeted Success Rate99.3 | 24 | |
| Targeted Adversarial Attack | ImageNet-compatible (test) | Overall Success Rate83 | 13 | |
| Targeted Attack | ImageNet-Compatible 10-Targets (all-source) | CNX Robustness96 | 13 | |
| Object Detection | COCO 2017 | tSucDet65.3 | 12 | |
| Targeted Adversarial Attack | Google Cloud Vision API | Target 1 Success Rate54 | 4 | |
| Targeted Adversarial Attack | Azure API | T1 Success Rate89 | 4 | |
| Targeted Adversarial Attack | Baidu API | T1 Success Rate86 | 4 | |
| Targeted Adversarial Attack | GLM-4V Flash | Target 1 Score81 | 4 |