Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Measuring the Effects of Non-Identical Data Distribution for Federated Visual Classification

About

Federated Learning enables visual models to be trained in a privacy-preserving way using real-world data from mobile devices. Given their distributed nature, the statistics of the data across these devices is likely to differ significantly. In this work, we look at the effect such non-identical data distributions has on visual classification via Federated Learning. We propose a way to synthesize datasets with a continuous range of identicalness and provide performance measures for the Federated Averaging algorithm. We show that performance degrades as distributions differ more, and propose a mitigation strategy via server momentum. Experiments on CIFAR-10 demonstrate improved classification performance over a range of non-identicalness, with classification accuracy improved from 30.1% to 76.9% in the most skewed settings.

Tzu-Ming Harry Hsu, Hang Qi, Matthew Brown• 2019

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10 (test)
Accuracy85.44
3381
Image ClassificationCIFAR100 (test)
Accuracy53.69
112
Image ClassificationCIFAR10 centralized performance (test)
Accuracy84
104
Inference AttackFederated Learning environments Unauthorized Access (test)
Inference Attack Accuracy7.69
66
Inference AttackFederated Learning environments Authorized Access (test)
Inference Attack Accuracy28.57
66
Image ClassificationMNIST NN (test)
Communication Rounds19
62
Image ClassificationCIFAR-100 VGG-11 (test)
Communication Rounds73
61
Image ClassificationTiny-Imagenet Resnet20 (test)
Communication Rounds286
48
Image ClassificationCIFAR-10 LeNet-5 (test)
Communication Rounds79
44
MRI prostate segmentationProstate MRI (test)
Client 1 Score89.08
34
Showing 10 of 33 rows

Other info

Follow for update