Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Deep leakage from gradients

About

With the development of artificial intelligence technology, Federated Learning (FL) model has been widely used in many industries for its high efficiency and confidentiality. Some researchers have explored its confidentiality and designed some algorithms to attack training data sets, but these algorithms all have their own limitations. Therefore, most people still believe that local machine learning gradient information is safe and reliable. In this paper, an algorithm based on gradient features is designed to attack the federated learning model in order to attract more attention to the security of federated learning systems. In federated learning system, gradient contains little information compared with the original training data set, but this project intends to restore the original training image data through gradient information. Convolutional Neural Network (CNN) has excellent performance in image processing. Therefore, the federated learning model of this project is equipped with Convolutional Neural Network structure, and the model is trained by using image data sets. The algorithm calculates the virtual gradient by generating virtual image labels. Then the virtual gradient is matched with the real gradient to restore the original image. This attack algorithm is written in Python language, uses cat and dog classification Kaggle data sets, and gradually extends from the full connection layer to the convolution layer, thus improving the universality. At present, the average squared error between the data recovered by this algorithm and the original image information is approximately 5, and the vast majority of images can be completely restored according to the gradient information given, indicating that the gradient of federated learning system is not absolutely safe and reliable.

Yaqiong Mu• 2022

Related benchmarks

TaskDatasetResultRank
Sentiment ClassificationSST-2
Accuracy94
174
Image ClassificationSTL10
Accuracy80.2
60
Adjacency Matrix ReconstructionGraph Data Instances
AUC87.3
45
Node Feature ReconstructionGraph Data Instances
MSE0.9537
45
Language ModelingEnron Dataset
Perplexity3.4
39
Text ClassificationTweet Sentiment
F1 Score69
31
Topic ClassificationYahoo Answers Topics
Accuracy61
26
Language ModelingAG-News
PPL4.76
20
Language ModelingOpen Australian Legal Corpus
Loss1.32
12
Image ReconstructionImageNet1K
PSNR10.252
10
Showing 10 of 22 rows

Other info

Follow for update