FedGrad: Mitigating Backdoor Attacks in Federated Learning Through Local Ultimate Gradients Inspection
About
Federated learning (FL) enables multiple clients to train a model without compromising sensitive data. The decentralized nature of FL makes it susceptible to adversarial attacks, especially backdoor insertion during training. Recently, the edge-case backdoor attack employing the tail of the data distribution has been proposed as a powerful one, raising questions about the shortfall in current defenses' robustness guarantees. Specifically, most existing defenses cannot eliminate edge-case backdoor attacks or suffer from a trade-off between backdoor-defending effectiveness and overall performance on the primary task. To tackle this challenge, we propose FedGrad, a novel backdoor-resistant defense for FL that is resistant to cutting-edge backdoor attacks, including the edge-case attack, and performs effectively under heterogeneous client data and a large number of compromised clients. FedGrad is designed as a two-layer filtering mechanism that thoroughly analyzes the ultimate layer's gradient to identify suspicious local updates and remove them from the aggregation process. We evaluate FedGrad under different attack scenarios and show that it significantly outperforms state-of-the-art defense mechanisms. Notably, FedGrad can almost 100% correctly detect the malicious participants, thus providing a significant reduction in the backdoor effect (e.g., backdoor accuracy is less than 8%) while not reducing the main accuracy on the primary task.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Federated Image Classification | CIFAR-100 IID (test) | CBA MTA63.38 | 9 | |
| Image Classification | CIFAR-100 non-IID alpha=0.5 | CBA MTA60.75 | 9 | |
| Backdoor Defense | Fashion-MNIST non-IID alpha=0.5 | CBA MTA82.45 | 9 | |
| Federated Image Classification | CIFAR-10 iid (test) | CBA MTA89.1 | 9 | |
| Image Classification | CIFAR-10 non-IID alpha=0.5 | CBA MTA80.88 | 9 | |
| Backdoor Defense | MNIST alpha=0.5 (non-IID) | CBA MTA97.12 | 9 | |
| Malicious Client Detection | MNIST alpha=0.5 (non-IID) | CBA True Positive Rate (TPR)100 | 8 | |
| Malicious Client Detection | Fashion-MNIST alpha=0.5 (non-IID) | CBA TPR100 | 8 | |
| Malicious Client Detection | CIFAR-10 alpha=0.5 (Non-IID) | CBA TPR100 | 8 | |
| Malicious Client Detection | CIFAR-100 alpha=0.5 (non-IID) | CBA TPR100 | 8 |