Domain Generalization by Mutual-Information Regularization with Pre-trained Models

About

Domain generalization (DG) aims to learn a generalized model to an unseen target domain using only limited source domains. Previous attempts to DG fail to learn domain-invariant representations only from the source domains due to the significant domain shifts between training and test domains. Instead, we re-formulate the DG objective using mutual information with the oracle model, a model generalized to any possible domain. We derive a tractable variational lower bound via approximating the oracle model by a pre-trained model, called Mutual Information Regularization with Oracle (MIRO). Our extensive experiments show that MIRO significantly improves the out-of-distribution performance. Furthermore, our scaling experiments show that the larger the scale of the pre-trained model, the greater the performance improvement of MIRO. Source code is available at https://github.com/kakaobrain/miro.

Junbum Cha, Kyungjae Lee, Sungrae Park, Sanghyuk Chun• 2022

Related benchmarks

Task	Dataset	Result
Domain Generalization	VLCS	Accuracy79.1	270
Domain Generalization	PACS	Accuracy85.4	263
Image Classification	DomainNet	Accuracy (ClipArt)74.9	238
Domain Generalization	OfficeHome	Accuracy70.7	234
Image Classification	OfficeHome	Average Accuracy70.5	161
Domain Generalization	DomainNet	Accuracy44.3	153
Image Classification	PACS	Accuracy85.4	130
Domain Generalization	DomainBed	Average Accuracy77.3	127
Domain Generalization	TerraIncognita	Accuracy50.4	121
Domain Generalization	DomainBed (test)	VLCS Accuracy79.9	118

Showing 10 of 47 rows

Other info

Code

Follow for update

@wizwand_team Discord