Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FAIR-Ensemble: When Fairness Naturally Emerges From Deep Ensembling

About

Ensembling multiple Deep Neural Networks (DNNs) is a simple and effective way to improve top-line metrics and to outperform a larger single model. In this work, we go beyond top-line metrics and instead explore the impact of ensembling on subgroup performances. Surprisingly, we observe that even with a simple homogeneous ensemble -- all the individual DNNs share the same training set, architecture, and design choices -- the minority group performance disproportionately improves with the number of models compared to the majority group, i.e. fairness naturally emerges from ensembling. Even more surprising, we find that this gain keeps occurring even when a large number of models is considered, e.g. $20$, despite the fact that the average performance of the ensemble plateaus with fewer models. Our work establishes that simple DNN ensembles can be a powerful tool for alleviating disparate impact from DNN classifiers, thus curbing algorithmic harm. We also explore why this is the case. We find that even in homogeneous ensembles, varying the sources of stochasticity through parameter initialization, mini-batch sampling, and data-augmentation realizations, results in different fairness outcomes.

Wei-Yin Ko, Daniel D'souza, Karina Nguyen, Randall Balestriero, Sara Hooker• 2023

Related benchmarks

TaskDatasetResultRank
ClassificationGerman Credit (test)
Accuracy52.7
16
Fair ClassificationGerman Credit (test)
Equal Opportunity Difference36.5
15
ClassificationACSEmployment CT (test)
AV.ACC53.9
14
Employment PredictionACSEmployment OR (Oregon) (test)
AV.ACC48.8
14
Income PredictionACSIncome (state RI)
Average Accuracy (AV.ACC)53.7
14
Tabular ClassificationACSIncome state VT
Average Accuracy61.4
14
ClassificationACSIncome state RI (test)
Avg Accuracy50.1
14
ClassificationACSIncome state AZ
Avg Acc46.5
14
Employment PredictionACSEmployment state LA (test)
AV.ACC49.9
14
Employment PredictionACSEmployment MI (test)
AV.ACC46.1
14
Showing 10 of 12 rows

Other info

Follow for update