Optimising Equal Opportunity Fairness in Model Training

About

Real-world datasets often encode stereotypes and societal biases. Such biases can be implicitly captured by trained models, leading to biased predictions and exacerbating existing societal preconceptions. Existing debiasing methods, such as adversarial training and removing protected information from representations, have been shown to reduce bias. However, a disconnect between fairness criteria and training objectives makes it difficult to reason theoretically about the effectiveness of different techniques. In this work, we propose two novel training objectives which directly optimise for the widely-used criterion of {\it equal opportunity}, and show that they are effective in reducing bias while maintaining high performance over two classification tasks.

Aili Shen, Xudong Han, Trevor Cohn, Timothy Baldwin, Lea Frermann• 2022

Related benchmarks

Task	Dataset	Result
multi target-group identification	HateXplain (test)	African Group BA75.92	3
Target Detection	MHS Corpus (test)	Performance (Asian)82.51	3
Multi-Label Classification	MHS Corpus	Macro Precision54.18	3
Multi-Label Classification	HateXplain	Precision (macro)0.5789	3
Multi-Label Classification	MHS Corpus (test)	Hamming Loss12.1	3
Multi-Label Classification	HateXplain (test)	Hamming Loss7.22	3

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord