Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning Gradient-based Mixup with Extrapolation toward Flatter Minima for Domain Generalization

About

To address distribution shifts between training and test data, domain generalization (DG) leverages multiple source domains to learn a model that generalizes well to unseen domains. However, existing DG methods often overfit to the source domains, partly due to the limited coverage of the expected region in feature space. Motivated by this, we propose performing mixup with data interpolation and extrapolation to cover potentially unseen regions. To prevent the detrimental effects of unconstrained extrapolation, we carefully design a policy to generate the instance weights, named Flatness-aware Gradient-based Mixup (FGMix). The policy relies on gradient-based compatibilities to assign greater weights to instances that carry more invariant information and learn the mixup policy towards flatter minima for better generalization. On the DomainBed benchmark, we validate the efficacy of various designs of FGMix and demonstrate its superiority over other DG algorithms.

Danni Peng, Sinno Jialin Pan• 2022

Related benchmarks

TaskDatasetResultRank
Domain GeneralizationVLCS
Accuracy78.7
270
Domain GeneralizationPACS
Accuracy88.4
263
Domain GeneralizationOfficeHome
Accuracy71.3
234
Domain GeneralizationDomainNet
Accuracy45.1
153
Domain GeneralizationTerraIncognita
Accuracy52.3
121
Showing 5 of 5 rows

Other info

Follow for update