Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Erasing Undesirable Influence in Diffusion Models

About

Diffusion models are highly effective at generating high-quality images but pose risks, such as the unintentional generation of NSFW (not safe for work) content. Although various techniques have been proposed to mitigate unwanted influences in diffusion models while preserving overall performance, achieving a balance between these goals remains challenging. In this work, we introduce EraseDiff, an algorithm designed to preserve the utility of the diffusion model on retained data while removing the unwanted information associated with the data to be forgotten. Our approach formulates this task as a constrained optimization problem using the value function, resulting in a natural first-order algorithm for solving the optimization problem. By altering the generative process to deviate away from the ground-truth denoising trajectory, we update parameters for preservation while controlling constraint reduction to ensure effective erasure, striking an optimal trade-off. Extensive experiments and thorough comparisons with state-of-the-art algorithms demonstrate that EraseDiff effectively preserves the model's utility, efficacy, and efficiency.

Jing Wu, Trung Le, Munawar Hayat, Mehrtash Harandi• 2024

Related benchmarks

TaskDatasetResultRank
Class-wise ForgettingImageNette (val)
FID0.78
44
Concept UnlearningUnlearnDiffAtk
UnlearnDiffAtk0.1831
36
Class-wise ForgettingImagenette Stable Diffusion v1.4 (val)
FID0.78
22
Art Style UnlearningUnlearnCanvas Van Gogh style
FID54.48
18
Utility PreservationCOCO
CLIP Score0.307
14
Inappropriate Content ErasingI2P
I2P (%)0.85
14
Adversarial Robustness in Concept ErasingMMA-Diffusion
MMA-Diffusion Score4.1
14
Adversarial Robustness in Concept ErasingRing-A-Bell K-16, K-38, K-77
K-16 Score0.0211
14
Safety EvaluationRing-a-Bell
Ring-16 Score4.41
13
Object ErasingUnlearnCanvas
Unlearning Accuracy (UA)86.67
13
Showing 10 of 18 rows

Other info

Code

Follow for update