Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

eXplaining to Learn (eX2L): Regularization Using Contrastive Visual Explanation Pairs for Distribution Shifts

About

Despite extensive research into mitigating distribution shifts, many existing algorithms yield inconsistent performance, often failing to outperform baseline Empirical Risk Minimization (ERM) across diverse scenarios. Furthermore, high algorithmic complexity frequently limits interpretability and offers only an indirect means of addressing spurious correlations. We propose eXplaining to Learn (eX2L): an interpretable, explanation-based framework that decorrelates confounding features from a classifier's latent representations during training. eX2L achieves this by penalizing the similarity between Grad-CAM activation maps generated by a primary label classifier and those from a concurrently trained confounder classifier. On the rigorous Spawrious Many-to-Many Hard Challenge benchmark, eX2L achieves an average accuracy (AA) of 82.24% +/- 3.87% and a worst-group accuracy (WGA) of 66.31% +/- 8.73%, outperforming the current state-of-the-art (SOTA) by 5.49% and 10.90%, respectively. Beyond its competitive performance, eX2L demonstrates that functional domain invariance can be achieved by explicitly decoupling label and nuisance attributes at the group level.

Paulo Mario P. Medina, Jose Marie Antonio Mi\~noza, Sebastian C. Iba\~nez• 2026

Related benchmarks

TaskDatasetResultRank
ClassificationCelebA
Avg Accuracy91.7
197
Domain GeneralizationSpawrious M2M-Hard
Average Accuracy (AA)82.24
12
Subpopulation ShiftCMNIST
AA69.66
12
Subpopulation ShiftWaterbirds
Average Accuracy (AA)92.12
12
Domain GeneralizationSpawrious O2O-Easy
AA94.3
12
Showing 5 of 5 rows

Other info

Follow for update