Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Score-based Membership Inference on Diffusion Models

About

Membership inference attacks (MIAs) against Diffusion Models (DMs) raise pressing privacy concerns by revealing whether a sample was part of the training set. While existing methods typically rely on measuring reconstruction error across multiple denoising steps as a test statistic, they often incur significant computational overhead. In this work, we present a simple yet successful attack statistic using only the predicted noise vectors from the DM's denoiser, or equivalently, the score. Specifically, we show that the expected denoiser output points toward a kernel-weighted local mean of nearby training samples, such that its norm encodes proximity to the training set and thereby reveals membership. Building on this observation, we propose SimA, a single-query attack that provides a principled, efficient alternative to existing multi-query methods. SimA consistently achieves superior performance across variants of DMs and the Latent Diffusion Models (LDMs) on eight different datasets. Its Monte Carlo variant (SimA-MC) exhibits state-of-the-art performance across all experiments, significantly outperforming baseline methods in terms of TPR@1%FPR. These results demonstrate that complex reconstruction trajectories are unnecessary for effective membership inference, establishing SimA as a highly efficient benchmark for auditing privacy in DMs and LDMs.

Mingxing Rao, Bowen Qu, Daniel Moyer• 2025

Related benchmarks

TaskDatasetResultRank
Membership Inference AttackCIFAR-10
AUC92.16
120
Membership Inference AttackCIFAR-100
TPR @ 1% FPR44.66
46
Membership Inference AttackCelebA
AUC95.04
22
Membership Inference AttackImageNet
AUC71.13
15
Membership Inference AttackSTL10 U
ASR80.55
13
Membership Inference AttackPokémon lambdalabs blip-captions (fine-tuned)
AUC97.01
7
Membership Inference AttackMS-COCO fine-tuned 2017 (val)
AUC94.24
7
Membership Inference AttackFlickr30K (fine-tuned)
AUC72.23
7
Membership Inference AttackImageNet 1K V2 (train)
ASR85.73
7
Showing 9 of 9 rows

Other info

Follow for update