Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Knowledge-Enriched Distributional Model Inversion Attacks

About

Model inversion (MI) attacks are aimed at reconstructing training data from model parameters. Such attacks have triggered increasing concerns about privacy, especially given a growing number of online model repositories. However, existing MI attacks against deep neural networks (DNNs) have large room for performance improvement. We present a novel inversion-specific GAN that can better distill knowledge useful for performing attacks on private models from public data. In particular, we train the discriminator to differentiate not only the real and fake samples but the soft-labels provided by the target model. Moreover, unlike previous work that directly searches for a single data point to represent a target class, we propose to model a private data distribution for each target class. Our experiments show that the combination of these techniques can significantly boost the success rate of the state-of-the-art MI attacks by 150%, and generalize better to a variety of datasets and models. Our code is available at https://github.com/SCccc21/Knowledge-Enriched-DMI.

Si Chen, Mostafa Kahla, Ruoxi Jia, Guo-Jun Qi• 2020

Related benchmarks

TaskDatasetResultRank
Model InversionCelebA (test)
Attack Accuracy83
36
Model Inversion AttackCelebA (private) and FFHQ (public) (test)
Attack Accuracy52.87
24
Model Inversion AttackCelebA (test)
Attack Accuracy74
10
Object ClassificationCIFAR-10 (test)
Attack Accuracy95.2
8
Model Inversion AttackCelebA (private) FFHQ (public) on IR152 (test)
Top-5 Attack Accuracy85.33
8
Model Inversion AttackCelebA private FFHQ public on face.evoLve (test)
Top-5 Attack Accuracy80.67
8
Model Inversion AttackCelebA (private) FFHQ (public) on VGG16 (test)
Top-5 Attack Accuracy74
8
Model Inversion AttackCelebA private identities
Attack Accuracy82
4
Model Inversion AttackFacescrub (private identities)
Attack Accuracy0.48
4
Model Inversion AttackPubfig83 (private identities)
Attack Accuracy0.62
4
Showing 10 of 10 rows

Other info

Follow for update