FedDAT: An Approach for Foundation Model Finetuning in Multi-Modal Heterogeneous Federated Learning

About

Recently, foundation models have exhibited remarkable advancements in multi-modal learning. These models, equipped with millions (or billions) of parameters, typically require a substantial amount of data for finetuning. However, collecting and centralizing training data from diverse sectors becomes challenging due to distinct privacy regulations. Federated Learning (FL) emerges as a promising solution, enabling multiple clients to collaboratively train neural networks without centralizing their local data. To alleviate client computation burdens and communication overheads, previous works have adapted Parameter-efficient Finetuning (PEFT) methods for FL. Hereby, only a small fraction of the model parameters are optimized and communicated during federated communications. Nevertheless, most previous works have focused on a single modality and neglected one common phenomenon, i.e., the presence of data heterogeneity across the clients. Therefore, in this work, we propose a finetuning framework tailored to heterogeneous multi-modal FL, called Federated Dual-Aadapter Teacher (FedDAT). Specifically, our approach leverages a Dual-Adapter Teacher (DAT) to address data heterogeneity by regularizing the client local updates and applying Mutual Knowledge Distillation (MKD) for an efficient knowledge transfer. FedDAT is the first approach that enables an efficient distributed finetuning of foundation models for a variety of heterogeneous Vision-Language tasks. To demonstrate its effectiveness, we conduct extensive experiments on four multi-modality FL benchmarks with different types of data heterogeneity, where FedDAT substantially outperforms the existing centralized PEFT methods adapted for FL.

Haokun Chen, Yao Zhang, Denis Krompass, Jindong Gu, Volker Tresp• 2023

Related benchmarks

Task	Dataset	Result
Text Classification	AG News (test)	Accuracy92.8	326
Personalized Federated Learning	DRAKE dynamic (Others)	Alast50.08	40
Personalized Federated Learning	DRAKE dynamic (Self)	Alast66.08	40
Commonsense Generation	CommonGen (test)	METEOR16.42	39
Medical Visual Question Answering	Federated Medical VQA Mixed-Modality Task 3 VQA-Med 2019-2021 SLAKE VQA-RAD (test)	SLAKE Score72.79	34
Personalized Federated Learning	DRAKE (Self)	Alast61.28	30
Image Generation	Caltech-256	MSE0.3715	20
Personalized Federated Learning	DRAKE static (Others)	Alast49.76	20
Visual Question Answering	VQA (test)	BS Score54.96	20
Text Generation	MMLU (test)	BS Score46.16	20

Showing 10 of 26 rows

Other info

Follow for update

@wizwand_team Discord