Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Estimating before Debiasing: A Bayesian Approach to Detaching Prior Bias in Federated Semi-Supervised Learning

About

Federated Semi-Supervised Learning (FSSL) leverages both labeled and unlabeled data on clients to collaboratively train a model.In FSSL, the heterogeneous data can introduce prediction bias into the model, causing the model's prediction to skew towards some certain classes. Existing FSSL methods primarily tackle this issue by enhancing consistency in model parameters or outputs. However, as the models themselves are biased, merely constraining their consistency is not sufficient to alleviate prediction bias. In this paper, we explore this bias from a Bayesian perspective and demonstrate that it principally originates from label prior bias within the training data. Building upon this insight, we propose a debiasing method for FSSL named FedDB. FedDB utilizes the Average Prediction Probability of Unlabeled Data (APP-U) to approximate the biased prior.During local training, FedDB employs APP-U to refine pseudo-labeling through Bayes' theorem, thereby significantly reducing the label prior bias. Concurrently, during the model aggregation, FedDB uses APP-U from participating clients to formulate unbiased aggregate weights, thereby effectively diminishing bias in the global model. Experimental results show that FedDB can surpass existing FSSL methods. The code is available at https://github.com/GuogangZhu/FedDB.

Guogang Zhu, Xuefeng Liu, Xinghao Wu, Shaojie Tang, Chao Tang, Jianwei Niu, Hao Su• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationCINIC-10 (test)
Accuracy75.65
177
Image ClassificationSVHN (test)
Accuracy94.27
51
ClassificationCIFAR-100 10% labeled data
Accuracy51.55
46
Image ClassificationCIFAR-10 10% label
Accuracy87.49
42
Image ClassificationSVHN 1.0 (10% label)
Accuracy93.14
42
Image ClassificationCINIC-10 1.0 (10% label)
Accuracy73.61
42
Image ClassificationCIFAR-100 (test)
Accuracy0.5548
42
Federated Semi-supervised LearningCIFAR100 alpha=1.0 (test)
Convergence Round103
21
Image ClassificationCIFAR-100 alpha=0.1 (test)
Steps to 30% Accuracy103
7
Showing 9 of 9 rows

Other info

Follow for update