Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Domain Generalization by Mutual-Information Regularization with Pre-trained Models

About

Domain generalization (DG) aims to learn a generalized model to an unseen target domain using only limited source domains. Previous attempts to DG fail to learn domain-invariant representations only from the source domains due to the significant domain shifts between training and test domains. Instead, we re-formulate the DG objective using mutual information with the oracle model, a model generalized to any possible domain. We derive a tractable variational lower bound via approximating the oracle model by a pre-trained model, called Mutual Information Regularization with Oracle (MIRO). Our extensive experiments show that MIRO significantly improves the out-of-distribution performance. Furthermore, our scaling experiments show that the larger the scale of the pre-trained model, the greater the performance improvement of MIRO. Source code is available at https://github.com/kakaobrain/miro.

Junbum Cha, Kyungjae Lee, Sungrae Park, Sanghyuk Chun• 2022

Related benchmarks

TaskDatasetResultRank
Domain GeneralizationVLCS
Accuracy79.1
270
Domain GeneralizationPACS
Accuracy85.4
263
Image ClassificationDomainNet
Accuracy (ClipArt)74.9
238
Domain GeneralizationOfficeHome
Accuracy70.7
234
Image ClassificationOfficeHome
Average Accuracy70.5
161
Domain GeneralizationDomainNet
Accuracy44.3
153
Image ClassificationPACS
Accuracy85.4
130
Domain GeneralizationDomainBed
Average Accuracy77.3
127
Domain GeneralizationTerraIncognita
Accuracy50.4
121
Domain GeneralizationDomainBed (test)
VLCS Accuracy79.9
118
Showing 10 of 47 rows

Other info

Code

Follow for update