Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Bilateral Dependency Optimization: Defending Against Model-inversion Attacks

About

Through using only a well-trained classifier, model-inversion (MI) attacks can recover the data used for training the classifier, leading to the privacy leakage of the training data. To defend against MI attacks, previous work utilizes a unilateral dependency optimization strategy, i.e., minimizing the dependency between inputs (i.e., features) and outputs (i.e., labels) during training the classifier. However, such a minimization process conflicts with minimizing the supervised loss that aims to maximize the dependency between inputs and outputs, causing an explicit trade-off between model robustness against MI attacks and model utility on classification tasks. In this paper, we aim to minimize the dependency between the latent representations and the inputs while maximizing the dependency between latent representations and the outputs, named a bilateral dependency optimization (BiDO) strategy. In particular, we use the dependency constraints as a universally applicable regularizer in addition to commonly used losses for deep neural networks (e.g., cross-entropy), which can be instantiated with appropriate dependency criteria according to different tasks. To verify the efficacy of our strategy, we propose two implementations of BiDO, by using two different dependency measures: BiDO with constrained covariance (BiDO-COCO) and BiDO with Hilbert-Schmidt Independence Criterion (BiDO-HSIC). Experiments show that BiDO achieves the state-of-the-art defense performance for a variety of datasets, classifiers, and MI attacks while suffering a minor classification-accuracy drop compared to the well-trained classifier with no defense, which lights up a novel road to defend against MI attacks.

Xiong Peng, Feng Liu, Jingfen Zhang, Long Lan, Junjie Ye, Tongliang Liu, Bo Han• 2022

Related benchmarks

TaskDatasetResultRank
Model Inversion DefenseCelebA
Accuracy91.33
64
PPA Model Inversion AttackFaceScrub 224x224 Dpriv = Facescrub, Dpub = FFHQ (test)
Accuracy91.8
27
Model Inversion DefenseFace.evoLVe
Accuracy88.07
25
Model Inversion DefenseCelebA 64x64 (test)
Accuracy79.85
24
Model Inversion DefenseFFHQ
Accuracy79.62
12
Model Inversion DefenseCelebA (test)
Accuracy78.97
10
Model Inversion AttackFaceScrub (private) and FFHQ (public) 64x64 resolution (test)
Attack Accuracy79.85
7
Defense against Model Inversion AttackCelebA
Accuracy79.62
5
Model Inversion Defense EvaluationCelebA first 150 classes (test)
Count of User-Selected Similar Samples221
4
Model Inversion DefenseCelebA Private Public
Accuracy80.35
3
Showing 10 of 11 rows

Other info

Follow for update