Membership Inference Attacks against Machine Learning Models

About

We quantitatively investigate how machine learning models leak information about the individual data records on which they were trained. We focus on the basic membership inference attack: given a data record and black-box access to a model, determine if the record was in the model's training dataset. To perform membership inference against a target model, we make adversarial use of machine learning and train our own inference model to recognize differences in the target model's predictions on the inputs that it trained on versus the inputs that it did not train on. We empirically evaluate our inference techniques on classification models trained by commercial "machine learning as a service" providers such as Google and Amazon. Using realistic datasets and classification tasks, including a hospital discharge dataset whose membership is sensitive from the privacy perspective, we show that these models can be vulnerable to membership inference attacks. We then investigate the factors that influence this leakage and evaluate mitigation strategies.

Reza Shokri, Marco Stronati, Congzheng Song, Vitaly Shmatikov• 2016

Related benchmarks

Task	Dataset	Result
Membership Inference Attack	CIFAR-10	AUC0.6724	48
Membership Inference	CLIP image-text (train)	Precision79.37	36
Membership Inference Attack	CIFAR-100 balanced (evaluation set)	AUROC76.58	36
GRN Attack	Adult Income	MSE0.231	16
GRN Attack	CIFAR10	MSE0.121	16
GRN Attack	Drive Diagnosis	MSE0.144	16
Membership Inference Attack	Book-Crossing	Accuracy53.86	15
Membership Inference Attack	MovieLens	Accuracy60	15
Feature Inference Attack (GRN)	CIFAR100	MSE0.013	8
Feature Inference Attack (GRN)	MNIST	MSE0.104	8

Showing 10 of 18 rows

Other info

Follow for update

@wizwand_team Discord