Adaptive Federated Optimization

About

Federated learning is a distributed machine learning paradigm in which a large number of clients coordinate with a central server to learn a model without sharing their own training data. Standard federated optimization methods such as Federated Averaging (FedAvg) are often difficult to tune and exhibit unfavorable convergence behavior. In non-federated settings, adaptive optimization methods have had notable success in combating such issues. In this work, we propose federated versions of adaptive optimizers, including Adagrad, Adam, and Yogi, and analyze their convergence in the presence of heterogeneous data for general non-convex settings. Our results highlight the interplay between client heterogeneity and communication efficiency. We also perform extensive experiments on these methods and show that the use of adaptive optimizers can significantly improve the performance of federated learning.

Sashank Reddi, Zachary Charles, Manzil Zaheer, Zachary Garrett, Keith Rush, Jakub Kone\v{c}n\'y, Sanjiv Kumar, H. Brendan McMahan• 2020

Related benchmarks

Task	Dataset	Result
Object Hallucination Evaluation	POPE	Accuracy72.5	2019
Image Classification	CIFAR-10 (test)	Accuracy86.44	882
Multimodal Understanding	MMBench	Accuracy34.6	847
Multimodal Evaluation	MME	--	727
Image Classification	Tiny ImageNet (test)	Accuracy44.81	722
Image Classification	CIFAR10 (test)	Accuracy70	585
Multimodal Reasoning	MM-Vet	MM-Vet Score24.6	517
Image Classification	CIFAR-100 (test)	--	395
Multimodal Perception and Cognition	MME	Overall Score1.00e+3	270
Multimodal Understanding	SEED	Accuracy25.3	216

Showing 10 of 122 rows

...

Other info

Follow for update

@wizwand_team Discord