FedGrad: Mitigating Backdoor Attacks in Federated Learning Through Local Ultimate Gradients Inspection

About

Federated learning (FL) enables multiple clients to train a model without compromising sensitive data. The decentralized nature of FL makes it susceptible to adversarial attacks, especially backdoor insertion during training. Recently, the edge-case backdoor attack employing the tail of the data distribution has been proposed as a powerful one, raising questions about the shortfall in current defenses' robustness guarantees. Specifically, most existing defenses cannot eliminate edge-case backdoor attacks or suffer from a trade-off between backdoor-defending effectiveness and overall performance on the primary task. To tackle this challenge, we propose FedGrad, a novel backdoor-resistant defense for FL that is resistant to cutting-edge backdoor attacks, including the edge-case attack, and performs effectively under heterogeneous client data and a large number of compromised clients. FedGrad is designed as a two-layer filtering mechanism that thoroughly analyzes the ultimate layer's gradient to identify suspicious local updates and remove them from the aggregation process. We evaluate FedGrad under different attack scenarios and show that it significantly outperforms state-of-the-art defense mechanisms. Notably, FedGrad can almost 100% correctly detect the malicious participants, thus providing a significant reduction in the backdoor effect (e.g., backdoor accuracy is less than 8%) while not reducing the main accuracy on the primary task.

Thuy Dung Nguyen, Anh Duy Nguyen, Kok-Seng Wong, Huy Hieu Pham, Thanh Hung Nguyen, Phi Le Nguyen, Truong Thao Nguyen• 2023

Related benchmarks

Task	Dataset	Result
Federated Image Classification	CIFAR-100 IID (test)	CBA MTA63.38	9
Image Classification	CIFAR-100 non-IID alpha=0.5	CBA MTA60.75	9
Backdoor Defense	Fashion-MNIST non-IID alpha=0.5	CBA MTA82.45	9
Federated Image Classification	CIFAR-10 iid (test)	CBA MTA89.1	9
Image Classification	CIFAR-10 non-IID alpha=0.5	CBA MTA80.88	9
Backdoor Defense	MNIST alpha=0.5 (non-IID)	CBA MTA97.12	9
Malicious Client Detection	MNIST alpha=0.5 (non-IID)	CBA True Positive Rate (TPR)100	8
Malicious Client Detection	Fashion-MNIST alpha=0.5 (non-IID)	CBA TPR100	8
Malicious Client Detection	CIFAR-10 alpha=0.5 (Non-IID)	CBA TPR100	8
Malicious Client Detection	CIFAR-100 alpha=0.5 (non-IID)	CBA TPR100	8

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord