Superclass-Guided Representation Disentanglement for Spurious Correlation Mitigation
About
To enhance group robustness to spurious correlations, prior work often relies on auxiliary group annotations and assumes identical sets of groups across training and test domains. To overcome these limitations, we propose to leverage superclasses -- categories that lie higher in the semantic hierarchy than the task's actual labels -- as a more intrinsic signal than group labels for discerning spurious correlations. Our model incorporates superclass guidance from a pretrained vision-language model via gradient-based attention alignment, and then integrates feature disentanglement with a theoretically supported minimax-optimal feature-usage strategy. As a result, our approach attains robustness to more complex group structures and spurious correlations, without the need to annotate any training samples. Experiments across diverse domain generalization tasks show that our method significantly outperforms strong baselines and goes well beyond the vision-language model's guidance, with clear improvements in both quantitative metrics and qualitative visualizations.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | MetaShift (test) | Average Accuracy79.5 | 27 | |
| Image Classification | Waterbirds 95% | Accuracy Variance (Group)6 | 22 | |
| Image Classification | Waterbirds 100% | Accuracy Variance across Groups16 | 22 | |
| Image Classification | Spawrious | O2O Easy Accuracy90.9 | 22 | |
| Classification | MetaShift | Average Worst Group Accuracy76.8 | 20 | |
| Image Classification | BFFHQ bias-conflicting (test) | -- | 17 | |
| Image Classification | Spawrious (test) | O2O Accuracy (Easy)82.7 | 15 | |
| Group Robustness | Waterbirds 100% | Worst Group Accuracy79.7 | 11 | |
| Image Classification | Waterbirds 100% correlation (test) | Worst-group Accuracy79.7 | 11 | |
| Group Robustness | Waterbirds 95% | Worst Group Accuracy84.4 | 11 |