Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

How to Backdoor Diffusion Models?

About

Diffusion models are state-of-the-art deep learning empowered generative models that are trained based on the principle of learning forward and reverse diffusion processes via progressive noise-addition and denoising. To gain a better understanding of the limitations and potential risks, this paper presents the first study on the robustness of diffusion models against backdoor attacks. Specifically, we propose BadDiffusion, a novel attack framework that engineers compromised diffusion processes during model training for backdoor implantation. At the inference stage, the backdoored diffusion model will behave just like an untampered generator for regular data inputs, while falsely generating some targeted outcome designed by the bad actor upon receiving the implanted trigger signal. Such a critical risk can be dreadful for downstream tasks and applications built upon the problematic model. Our extensive experiments on various backdoor attack settings show that BadDiffusion can consistently lead to compromised diffusion models with high utility and target specificity. Even worse, BadDiffusion can be made cost-effective by simply finetuning a clean pre-trained diffusion model to implant backdoors. We also explore some possible countermeasures for risk mitigation. Our results call attention to potential risks and possible misuse of diffusion models. Our code is available on https://github.com/IBM/BadDiffusion.

Sheng-Yen Chou, Pin-Yu Chen, Tsung-Yi Ho• 2022

Related benchmarks

TaskDatasetResultRank
Secret image extractionCIFAR10 32x32
PSNR22.08
10
Secret image extractionLSUN Bedroom 256x256
PSNR17.68
10
Diffusion Backdoor AttackCelebA-HQ
FID46.15
6
Diffusion Backdoor AttackCIFAR-10
FID42.86
6
Neural SteganographyCIFAR10 32x32 resolution
FID6.88
5
Neural SteganographyLSUN Bedroom 256x256 resolution
FID15.75
5
Diffusion Backdoor AttackCIFAR-100
FID55.27
3
Showing 7 of 7 rows

Other info

Code

Follow for update