Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Coresets for Vertical Federated Learning: Regularized Linear Regression and $K$-Means Clustering

About

Vertical federated learning (VFL), where data features are stored in multiple parties distributively, is an important area in machine learning. However, the communication complexity for VFL is typically very high. In this paper, we propose a unified framework by constructing coresets in a distributed fashion for communication-efficient VFL. We study two important learning tasks in the VFL setting: regularized linear regression and $k$-means clustering, and apply our coreset framework to both problems. We theoretically show that using coresets can drastically alleviate the communication complexity, while nearly maintain the solution quality. Numerical experiments are conducted to corroborate our theoretical findings.

Lingxiao Huang, Zhize Li, Jialin Sun, Haoyu Zhao• 2022

Related benchmarks

TaskDatasetResultRank
k-means clusteringYearPredictionMSD (train)
Avg Cost71.92
26
Regularized Linear RegressionYearPredictionMSD (train)
Average Cost91.97
25
Clusteringchowdary 2006
COLLAB Silhouette0.769
7
ClusteringCOIL-20 (test)
COLLAB. SIL0.207
7
Clusteringalizadeh 2000 v2
COLLAB SIL Score0.222
7
ClusteringCOIL-20
Collaborative ARI56.9
7
Clusteringalizadeh 2000 v2
COLLAB ARI0.812
7
Clusteringsynthetic multimodal dataset
ARI (AGENT0)9.11e-4
6
Showing 8 of 8 rows

Other info

Code

Follow for update