Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Towards Debiasing Sentence Representations

About

As natural language processing methods are increasingly deployed in real-world scenarios such as healthcare, legal systems, and social science, it becomes necessary to recognize the role they potentially play in shaping social biases and stereotypes. Previous work has revealed the presence of social biases in widely used word embeddings involving gender, race, religion, and other social constructs. While some methods were proposed to debias these word-level embeddings, there is a need to perform debiasing at the sentence-level given the recent shift towards new contextualized sentence representations such as ELMo and BERT. In this paper, we investigate the presence of social biases in sentence-level representations and propose a new method, Sent-Debias, to reduce these biases. We show that Sent-Debias is effective in removing biases, and at the same time, preserves performance on sentence-level downstream tasks such as sentiment analysis, linguistic acceptability, and natural language understanding. We hope that our work will inspire future research on characterizing and removing social biases from widely adopted sentence representations for fairer NLP.

Paul Pu Liang, Irene Mengze Li, Emily Zheng, Yao Chong Lim, Ruslan Salakhutdinov, Louis-Philippe Morency• 2020

Related benchmarks

TaskDatasetResultRank
Counterfactual Input EvaluationCrowS-Pairs
SS42.14
33
Bias MeasurementStereoSet
Overall SS60.84
25
Occupation classificationBias-in-Bios
Accuracy (Overall)0.8356
18
Stereotype Bias EvaluationStereoSet Gender
LMS Score85.6
15
Gender bias evaluationSEAT
SEAT 60.336
13
Bias EvaluationCrowS-Pairs
CS Score52.29
13
Stereotypical Bias EvaluationStereoSet (dev)
Overall LMS Score84.166
12
Bias EvaluationCrow-S
Score53.817
9
Natural Language InferenceQNLI (test)
SEAT e-size (Names: Career/Family)0.05
8
Sentiment AnalysisSST-2 (test)
SEAT e-size: Names (Career/Family)0.1
8
Showing 10 of 11 rows

Other info

Follow for update