Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Superclass-Guided Representation Disentanglement for Spurious Correlation Mitigation

About

To enhance group robustness to spurious correlations, prior work often relies on auxiliary group annotations and assumes identical sets of groups across training and test domains. To overcome these limitations, we propose to leverage superclasses -- categories that lie higher in the semantic hierarchy than the task's actual labels -- as a more intrinsic signal than group labels for discerning spurious correlations. Our model incorporates superclass guidance from a pretrained vision-language model via gradient-based attention alignment, and then integrates feature disentanglement with a theoretically supported minimax-optimal feature-usage strategy. As a result, our approach attains robustness to more complex group structures and spurious correlations, without the need to annotate any training samples. Experiments across diverse domain generalization tasks show that our method significantly outperforms strong baselines and goes well beyond the vision-language model's guidance, with clear improvements in both quantitative metrics and qualitative visualizations.

Chenruo Liu, Hongjun Liu, Zeyu Lai, Yiqiu Shen, Chen Zhao, Qi Lei• 2025

Related benchmarks

TaskDatasetResultRank
Image ClassificationMetaShift (test)
Average Accuracy79.5
27
Image ClassificationWaterbirds 95%
Accuracy Variance (Group)6
22
Image ClassificationWaterbirds 100%
Accuracy Variance across Groups16
22
Image ClassificationSpawrious
O2O Easy Accuracy90.9
22
ClassificationMetaShift
Average Worst Group Accuracy76.8
20
Image ClassificationBFFHQ bias-conflicting (test)--
17
Image ClassificationSpawrious (test)
O2O Accuracy (Easy)82.7
15
Group RobustnessWaterbirds 100%
Worst Group Accuracy79.7
11
Image ClassificationWaterbirds 100% correlation (test)
Worst-group Accuracy79.7
11
Group RobustnessWaterbirds 95%
Worst Group Accuracy84.4
11
Showing 10 of 13 rows

Other info

Follow for update