Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FedDAT: An Approach for Foundation Model Finetuning in Multi-Modal Heterogeneous Federated Learning

About

Recently, foundation models have exhibited remarkable advancements in multi-modal learning. These models, equipped with millions (or billions) of parameters, typically require a substantial amount of data for finetuning. However, collecting and centralizing training data from diverse sectors becomes challenging due to distinct privacy regulations. Federated Learning (FL) emerges as a promising solution, enabling multiple clients to collaboratively train neural networks without centralizing their local data. To alleviate client computation burdens and communication overheads, previous works have adapted Parameter-efficient Finetuning (PEFT) methods for FL. Hereby, only a small fraction of the model parameters are optimized and communicated during federated communications. Nevertheless, most previous works have focused on a single modality and neglected one common phenomenon, i.e., the presence of data heterogeneity across the clients. Therefore, in this work, we propose a finetuning framework tailored to heterogeneous multi-modal FL, called Federated Dual-Aadapter Teacher (FedDAT). Specifically, our approach leverages a Dual-Adapter Teacher (DAT) to address data heterogeneity by regularizing the client local updates and applying Mutual Knowledge Distillation (MKD) for an efficient knowledge transfer. FedDAT is the first approach that enables an efficient distributed finetuning of foundation models for a variety of heterogeneous Vision-Language tasks. To demonstrate its effectiveness, we conduct extensive experiments on four multi-modality FL benchmarks with different types of data heterogeneity, where FedDAT substantially outperforms the existing centralized PEFT methods adapted for FL.

Haokun Chen, Yao Zhang, Denis Krompass, Jindong Gu, Volker Tresp• 2023

Related benchmarks

TaskDatasetResultRank
Text ClassificationAG News (test)
Accuracy92.8
293
Personalized Federated LearningDRAKE dynamic (Others)
Alast50.08
40
Personalized Federated LearningDRAKE dynamic (Self)
Alast66.08
40
Commonsense GenerationCommonGen (test)
METEOR16.42
39
Medical Visual Question AnsweringFederated Medical VQA Mixed-Modality Task 3 VQA-Med 2019-2021 SLAKE VQA-RAD (test)
SLAKE Score72.79
34
Personalized Federated LearningDRAKE (Self)
Alast61.28
30
Image GenerationCaltech-256
MSE0.3715
20
Personalized Federated LearningDRAKE static (Others)
Alast49.76
20
Visual Question AnsweringVQA (test)
BS Score54.96
20
Text GenerationMMLU (test)
BS Score46.16
20
Showing 10 of 26 rows

Other info

Follow for update