| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Image Classification | MetaShift | Average Accuracy93 | 33 | |
| Image Classification | MetaShift (test) | Average Accuracy92.1 | 27 | |
| Classification | MetaShift | Average Worst Group Accuracy89.3 | 20 | |
| Multi-class debiasing | MetaShift 10-class | Worst-group Acc (p=12%)70.08 | 3 | |
| Multi-class debiasing | MetaShift 2-class | Worst-group Accuracy (p=12%)77.78 | 3 |