Mind the Gap: Confidence Discrepancy Can Guide Federated Semi-Supervised Learning Across Pseudo-Mismatch
About
Federated Semi-Supervised Learning (FSSL) aims to leverage unlabeled data across clients with limited labeled data to train a global model with strong generalization ability. Most FSSL methods rely on consistency regularization with pseudo-labels, converting predictions from local or global models into hard pseudo-labels as supervisory signals. However, we discover that the quality of pseudo-label is largely deteriorated by data heterogeneity, an intrinsic facet of federated learning. In this paper, we study the problem of FSSL in-depth and show that (1) heterogeneity exacerbates pseudo-label mismatches, further degrading model performance and convergence, and (2) local and global models' predictive tendencies diverge as heterogeneity increases. Motivated by these findings, we propose a simple and effective method called Semi-supervised Aggregation for Globally-Enhanced Ensemble (SAGE), that can flexibly correct pseudo-labels based on confidence discrepancies. This strategy effectively mitigates performance degradation caused by incorrect pseudo-labels and enhances consensus between local and global models. Experimental results demonstrate that SAGE outperforms existing FSSL methods in both performance and convergence. Our code is available at https://github.com/Jay-Codeman/SAGE
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | CINIC-10 (test) | Accuracy78.77 | 177 | |
| Image Classification | SVHN (test) | Accuracy94.68 | 51 | |
| Classification | CIFAR-100 10% labeled data | Accuracy56.06 | 46 | |
| Image Classification | SVHN 1.0 (10% label) | Accuracy94.65 | 42 | |
| Image Classification | CIFAR-10 10% label | Accuracy89.08 | 42 | |
| Image Classification | CIFAR-100 (test) | Accuracy0.6201 | 42 | |
| Image Classification | CINIC-10 1.0 (10% label) | Accuracy76.68 | 42 | |
| Federated Semi-supervised Learning | CIFAR100 alpha=1.0 (test) | Convergence Round56 | 21 | |
| Image Classification | CIFAR-100 alpha=0.1 (test) | Steps to 30% Accuracy60 | 7 |