NeuTral Rewriter: A Rule-Based and Neural Approach to Automatic Rewriting into Gender-Neutral Alternatives
About
Recent years have seen an increasing need for gender-neutral and inclusive language. Within the field of NLP, there are various mono- and bilingual use cases where gender inclusive language is appropriate, if not preferred due to ambiguity or uncertainty in terms of the gender of referents. In this work, we present a rule-based and a neural approach to gender-neutral rewriting for English along with manually curated synthetic data (WinoBias+) and natural data (OpenSubtitles and Reddit) benchmarks. A detailed manual and automatic evaluation highlights how our NeuTral Rewriter, trained on data generated by the rule-based approach, obtains word error rates (WER) below 0.18% on synthetic, in-domain and out-domain test sets.
Eva Vanmassenhove, Chris Emmery, Dimitar Shterionov• 2021
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Gender-fair rewriting | OpenSubtitles (test) | Tokenised WER0.43 | 4 | |
| Gender-fair rewriting | Reddit (test) | Tokenised WER0.75 | 4 | |
| Gender-fair rewriting | WinoBias+ (test) | Tokenised WER0.09 | 4 |
Showing 3 of 3 rows