Fairness-Aware Multi-Group Target Detection in Online Discussion
About
Target-group detection is the task of detecting which group(s) a piece of content is ``directed at or about''. Applications include targeted marketing, content recommendation, and group-specific content assessment. Key challenges include: 1) that a single post may target multiple groups; and 2) ensuring consistent detection accuracy across groups for fairness. In this work, we investigate fairness implications of target-group detection in the context of toxicity detection, where the perceived harm of a social media post often depends on which group(s) it targets. Because toxicity is highly contextual, language that appears benign in general can be harmful when targeting specific demographic groups. We show our {\em fairness-aware multi-group target detection} approach both reduces bias across groups and shows strong predictive performance, surpassing existing fairness-aware baselines. To enable reproducibility and spur future work, we share our code online.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-Label Classification | MHS Corpus | Macro Precision78.54 | 3 | |
| Multi-Label Classification | HateXplain | Precision (macro)0.7519 | 3 | |
| Multi-Label Classification | MHS Corpus (test) | Hamming Loss6.85 | 3 | |
| Multi-Label Classification | HateXplain (test) | Hamming Loss5.89 | 3 | |
| Target Detection | MHS Corpus (test) | Performance (Asian)83.18 | 3 | |
| multi target-group identification | HateXplain (test) | African Group BA74.29 | 3 |