ProxyFL: A Proxy-Guided Framework for Federated Semi-Supervised Learning

About

Federated Semi-Supervised Learning (FSSL) aims to collaboratively train a global model across clients by leveraging partially-annotated local data in a privacy-preserving manner. In FSSL, data heterogeneity is a challenging issue, which exists both across clients and within clients. External heterogeneity refers to the data distribution discrepancy across different clients, while internal heterogeneity represents the mismatch between labeled and unlabeled data within clients. Most FSSL methods typically design fixed or dynamic parameter aggregation strategies to collect client knowledge on the server (external) and / or filter out low-confidence unlabeled samples to reduce mistakes in local client (internal). But, the former is hard to precisely fit the ideal global distribution via direct weights, and the latter results in fewer data participation into FL training. To this end, we propose a proxy-guided framework called ProxyFL that focuses on simultaneously mitigating external and internal heterogeneity via a unified proxy. I.e., we consider the learnable weights of classifier as proxy to simulate the category distribution both locally and globally. For external, we explicitly optimize global proxy against outliers instead of direct weights; for internal, we re-include the discarded samples into training by a positive-negative proxy pool to mitigate the impact of potentially-incorrect pseudo-labels. Insight experiments & theoretical analysis show our significant performance and convergence in FSSL.

Duowen Chen, Yan Wang• 2026

Related benchmarks

Task	Dataset	Result
Image Classification	CINIC-10 (test)	Accuracy81.46	177
Image Classification	SVHN (test)	Accuracy95.4	51
Classification	CIFAR-100 10% labeled data	Accuracy58.75	46
Image Classification	SVHN 1.0 (10% label)	Accuracy95.26	42
Image Classification	CINIC-10 1.0 (10% label)	Accuracy79.59	42
Image Classification	CIFAR-100 (test)	Accuracy0.6319	42
Image Classification	CIFAR-10 10% label	Accuracy90	42
Federated Semi-supervised Learning	CIFAR100 alpha=1.0 (test)	Convergence Round45	21
Image Classification	CIFAR-100 alpha=0.1 (test)	Steps to 30% Accuracy45	7

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord