FedCC: Robust Federated Learning against Model Poisoning Attacks
About
Federated learning is a distributed framework designed to address privacy concerns. However, it introduces new attack surfaces, which are especially prone when data is non-Independently and Identically Distributed. Existing approaches fail to effectively mitigate the malicious influence in this setting; previous approaches often tackle non-IID data and poisoning attacks separately. To address both challenges simultaneously, we present FedCC, a simple yet effective novel defense algorithm against model poisoning attacks. It leverages the Centered Kernel Alignment similarity of Penultimate Layer Representations for clustering, allowing the identification and filtration of malicious clients, even in non-IID data settings. The penultimate layer representations are meaningful since the later layers are more sensitive to local data distributions, which allows better detection of malicious clients. The sophisticated utilization of layer-wise Centered Kernel Alignment similarity allows attack mitigation while leveraging useful knowledge obtained. Our extensive experiments demonstrate the effectiveness of FedCC in mitigating both untargeted model poisoning and targeted backdoor attacks. Compared to existing outlier detection-based and first-order statistics-based methods, FedCC consistently reduces attack confidence to zero. Specifically, it significantly minimizes the average degradation of global performance by 65.5\%. We believe that this new perspective on aggregation makes it a valuable contribution to the field of FL model security and privacy. The code will be made available upon acceptance.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Intrusion Detection | Edge-IIoTset | Accuracy95 | 84 | |
| Intrusion Detection | Edge-IIoTset | Accuracy98 | 48 | |
| Intrusion Detection | CIC-IDS 2018 | Accuracy97 | 48 | |
| Untargeted Attack Detection | Edge-IIoTset non-IID | Accuracy93 | 48 | |
| Untargeted Attack Detection | CIC-IDS non-IID 2018 | Accuracy93 | 48 | |
| Targeted attack detection | CIC-IDS IID 2018 | Accuracy96 | 48 | |
| Targeted attack detection | CIC-IDS non-IID 20% Adversaries 2018 | Detection Accuracy93 | 16 | |
| Targeted attack detection | CIC-IDS 2018 (non-IID, 40% Adversaries) | Detection Performance92 | 16 | |
| Targeted attack detection | Edge-IIoTset non-IID 10% Adversaries | Detection Rate93 | 16 | |
| Targeted attack detection | Edge-IIoTset non-IID, 20% Adversaries | Detection Performance93 | 16 |