Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DSBA: Dynamic Stealthy Backdoor Attack with Collaborative Optimization in Self-Supervised Learning

About

Self-Supervised Learning (SSL) has emerged as a significant paradigm in representation learning thanks to its ability to learn without extensive labeled data, its strong generalization capabilities, and its potential for privacy preservation. However, recent research reveals that SSL models are also vulnerable to backdoor attacks. Existing backdoor attack methods in the SSL context commonly suffer from issues such as high detectability of triggers, feature entanglement, and pronounced out-of-distribution properties in poisoned samples, all of which compromises attack effectiveness and stealthiness. To that, we propose a Dynamic Stealthy Backdoor Attack (DSBA) backed by a new technique we term Collaborative Optimization. This method decouples the attack process into two collaborative optimization layers: the outer-layer optimization trains a backdoor encoder responsible for global feature space remodeling, aiming to achieve precise backdoor implantation while preserving core functionality; meanwhile, the inner-layer optimization employs a dynamically optimized generator to adaptively produce optimally concealed triggers for individual samples, achieving coordinated concealment across feature space and visual space. We also introduce multiple loss functions to dynamically balance attack performance and stealthiness, in which we employ an adaptive weight scheduling mechanism to enhance training stability. Extensive experiments on various mainstream SSL algorithms and five public datasets demonstrate that: (i) DSBA significantly enhances Attack Success Rate (ASR) and stealthiness while maintaining downstream task accuracy; and (ii) DSBA exhibits superior robustness against existing mainstream defense methods.

Jiayao Wang, Mohammad Maruf Hasan, Yiping Zhang, Xiaoying Lei, Jiale Zhang, Qilin Wu, Junwu Zhu, Dongfang Zhao• 2026

Related benchmarks

TaskDatasetResultRank
Backdoor DetectionCIFAR-10
Accuracy47.9
122
Backdoor Attack Stealthiness EvaluationCIFAR10
SSIM0.9918
40
Backdoor DetectionSVHN--
21
Image ClassificationSVHN
Balanced Accuracy72.18
15
Image ClassificationGTSRB
Balanced Accuracy (BA)78.23
15
Image ClassificationSTL-10
Balanced Accuracy (BA)84.15
10
Backdoor DetectionSTL-10 target attack
True Positives (TP)4
9
Image ClassificationCIFAR-10
Balanced Accuracy84.15
5
Backdoor DetectionSTL-10 Downstream from CIFAR-10 Pre-training
Anomaly Index1.05
1
Backdoor DetectionGTSRB Downstream CIFAR-10 Pre-training
Anomaly Index1.21
1
Showing 10 of 13 rows

Other info

Follow for update