Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Enhancing Cross-Modal Fine-Tuning with Gradually Intermediate Modality Generation

About

Large-scale pretrained models have proven immensely valuable in handling data-intensive modalities like text and image. However, fine-tuning these models for certain specialized modalities, such as protein sequence and cosmic ray, poses challenges due to the significant modality discrepancy and scarcity of labeled data. In this paper, we propose an end-to-end method, PaRe, to enhance cross-modal fine-tuning, aiming to transfer a large-scale pretrained model to various target modalities. PaRe employs a gating mechanism to select key patches from both source and target data. Through a modality-agnostic Patch Replacement scheme, these patches are preserved and combined to construct data-rich intermediate modalities ranging from easy to hard. By gradually intermediate modality generation, we can not only effectively bridge the modality gap to enhance stability and transferability of cross-modal fine-tuning, but also address the challenge of limited data in the target modality by leveraging enriched intermediate modality data. Compared with hand-designed, general-purpose, task-specific, and state-of-the-art cross-modal fine-tuning approaches, PaRe demonstrates superior performance across three challenging benchmarks, encompassing more than ten modalities.

Lincan Cai, Shuang Li, Wenxuan Ma, Jingxuan Kang, Binhui Xie, Zixun Sun, Chengwei Zhu• 2024

Related benchmarks

TaskDatasetResultRank
PDE solvingPDEBench Diff.Reac 1D (test)
nRMSE0.0029
13
PDE solvingPDEBench Advection (test)
nRMSE0.0032
9
Cross-modal adaptationNAS-Bench-360
Darcy (Relative L2)0.0074
9
PDE solvingPDEBench Diff.Sorp (test)
nRMSE0.0019
9
Diverse Prediction TasksNAS-Bench-360 (test)
Darcy Score0.0074
9
PDE solvingPDEBench Darcy (test)
nRMSE0.081
8
PDE solvingPDEBench Advection (1D)
nRMSE0.0032
5
BurgersPDEBench
nRMSE0.0114
5
PDE solvingPDEBench Burgers 1D
nRMSE0.0114
5
Aggregate Performance RankingPDEBench Multiple Tasks
Avg Rank3
5
Showing 10 of 19 rows

Other info

Follow for update