Selective Knowledge Sharing for Privacy-Preserving Federated Distillation without A Good Teacher
About
While federated learning is promising for privacy-preserving collaborative learning without revealing local data, it remains vulnerable to white-box attacks and struggles to adapt to heterogeneous clients. Federated distillation (FD), built upon knowledge distillation--an effective technique for transferring knowledge from a teacher model to student models--emerges as an alternative paradigm, which provides enhanced privacy guarantees and addresses model heterogeneity. Nevertheless, challenges arise due to variations in local data distributions and the absence of a well-trained teacher model, which leads to misleading and ambiguous knowledge sharing that significantly degrades model performance. To address these issues, this paper proposes a selective knowledge sharing mechanism for FD, termed Selective-FD. It includes client-side selectors and a server-side selector to accurately and precisely identify knowledge from local and ensemble predictions, respectively. Empirical studies, backed by theoretical insights, demonstrate that our approach enhances the generalization capabilities of the FD framework and consistently outperforms baseline methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | CIFAR-10 IID | Accuracy86.06 | 185 | |
| Image Classification | MNIST non-IID (test) | Accuracy94.68 | 139 | |
| Image Classification | FashionMNIST (IID) | Accuracy87.16 | 17 | |
| Image Classification | MNIST (i.i.d.) | Test Accuracy98.6 | 15 | |
| Image Classification | CIFAR10 Weak Non-IID | Accuracy85.38 | 8 | |
| Image Classification | FashionMNIST Strong Non-IID | Accuracy75.31 | 8 | |
| Image Classification | CIFAR10 Strong Non-IID | Accuracy80.98 | 8 | |
| Image Classification | MNIST (Weak Non-IID) | Accuracy96.3 | 8 | |
| Image Classification | FashionMNIST (Weak Non-IID) | Accuracy77.27 | 8 |