Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

IMO: Greedy Layer-Wise Sparse Representation Learning for Out-of-Distribution Text Classification with Pre-trained Models

About

Machine learning models have made incredible progress, but they still struggle when applied to examples from unseen domains. This study focuses on a specific problem of domain generalization, where a model is trained on one source domain and tested on multiple target domains that are unseen during training. We propose IMO: Invariant features Masks for Out-of-Distribution text classification, to achieve OOD generalization by learning invariant features. During training, IMO would learn sparse mask layers to remove irrelevant features for prediction, where the remaining features keep invariant. Additionally, IMO has an attention module at the token level to focus on tokens that are useful for prediction. Our comprehensive experiments show that IMO substantially outperforms strong baselines in terms of various evaluation metrics and settings.

Tao Feng, Lizhen Qu, Zhuang Li, Haolan Zhan, Yuncheng Hua, Gholamreza Haffari• 2024

Related benchmarks

TaskDatasetResultRank
Sentiment AnalysisIMDB
Accuracy93.97
57
Multi-class Topic ClassificationAG-News
Title to Desc Alignment Score89.4
14
Sentiment AnalysisIMDB, Amazon, Yelp, and TweetEval Overall Average
Average Accuracy91.81
14
Social factor predictionSocialDial Synthetic to Human
Location Macro F123.22
10
Showing 4 of 4 rows

Other info

Code

Follow for update