Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FedGrad: Mitigating Backdoor Attacks in Federated Learning Through Local Ultimate Gradients Inspection

About

Federated learning (FL) enables multiple clients to train a model without compromising sensitive data. The decentralized nature of FL makes it susceptible to adversarial attacks, especially backdoor insertion during training. Recently, the edge-case backdoor attack employing the tail of the data distribution has been proposed as a powerful one, raising questions about the shortfall in current defenses' robustness guarantees. Specifically, most existing defenses cannot eliminate edge-case backdoor attacks or suffer from a trade-off between backdoor-defending effectiveness and overall performance on the primary task. To tackle this challenge, we propose FedGrad, a novel backdoor-resistant defense for FL that is resistant to cutting-edge backdoor attacks, including the edge-case attack, and performs effectively under heterogeneous client data and a large number of compromised clients. FedGrad is designed as a two-layer filtering mechanism that thoroughly analyzes the ultimate layer's gradient to identify suspicious local updates and remove them from the aggregation process. We evaluate FedGrad under different attack scenarios and show that it significantly outperforms state-of-the-art defense mechanisms. Notably, FedGrad can almost 100% correctly detect the malicious participants, thus providing a significant reduction in the backdoor effect (e.g., backdoor accuracy is less than 8%) while not reducing the main accuracy on the primary task.

Thuy Dung Nguyen, Anh Duy Nguyen, Kok-Seng Wong, Huy Hieu Pham, Thanh Hung Nguyen, Phi Le Nguyen, Truong Thao Nguyen• 2023

Related benchmarks

TaskDatasetResultRank
Federated Image ClassificationCIFAR-100 IID (test)
CBA MTA63.38
9
Image ClassificationCIFAR-100 non-IID alpha=0.5
CBA MTA60.75
9
Backdoor DefenseFashion-MNIST non-IID alpha=0.5
CBA MTA82.45
9
Federated Image ClassificationCIFAR-10 iid (test)
CBA MTA89.1
9
Image ClassificationCIFAR-10 non-IID alpha=0.5
CBA MTA80.88
9
Backdoor DefenseMNIST alpha=0.5 (non-IID)
CBA MTA97.12
9
Malicious Client DetectionMNIST alpha=0.5 (non-IID)
CBA True Positive Rate (TPR)100
8
Malicious Client DetectionFashion-MNIST alpha=0.5 (non-IID)
CBA TPR100
8
Malicious Client DetectionCIFAR-10 alpha=0.5 (Non-IID)
CBA TPR100
8
Malicious Client DetectionCIFAR-100 alpha=0.5 (non-IID)
CBA TPR100
8
Showing 10 of 10 rows

Other info

Follow for update