Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FedAFD: Multimodal Federated Learning via Adversarial Fusion and Distillation

About

Multimodal Federated Learning (MFL) enables clients with heterogeneous data modalities to collaboratively train models without sharing raw data, offering a privacy-preserving framework that leverages complementary cross-modal information. However, existing methods often overlook personalized client performance and struggle with modality/task discrepancies, as well as model heterogeneity. To address these challenges, we propose FedAFD, a unified MFL framework that enhances client and server learning. On the client side, we introduce a bi-level adversarial alignment strategy to align local and global representations within and across modalities, mitigating modality and task gaps. We further design a granularity-aware fusion module to integrate global knowledge into the personalized features adaptively. On the server side, to handle model heterogeneity, we propose a similarity-guided ensemble distillation mechanism that aggregates client representations on shared public data based on feature similarity and distills the fused knowledge into the global model. Extensive experiments conducted under both IID and non-IID settings demonstrate that FedAFD achieves superior performance and efficiency for both the client and the server.

Min Tan, Junchao Ma, Yinfu Feng, Jiajun Ding, Wenwen Pan, Tingting Han, Qian Zheng, Zhenzhong Kuang, Zhou Yu• 2026

Related benchmarks

TaskDatasetResultRank
Text-to-Image RetrievalFlickr30k (test)
Recall@129.83
445
Image ClassificationCIFAR-100--
435
Image-to-Text RetrievalFlickr30k (test)
R@136.55
392
Text ClassificationAG News (test)--
228
Image ClassificationCIFAR-100 (test)
Acc33.18
110
Text ClassificationAGNews
Accuracy89.34
61
Text-to-Image RetrievalMS COCO 1K
R@126.02
51
Cross-modal retrievalFlickr30k (test)
Image-to-text Recall@132.48
25
Cross-modal retrievalMS-COCO (test)
R@1 (I2T)33.98
8
Cross-modal retrievalMS-COCO 1K image folds (test)
RSum@159.8
8
Showing 10 of 11 rows

Other info

Follow for update