Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Focal Inferential Infusion Coupled with Tractable Density Discrimination for Implicit Hate Detection

About

Although pretrained large language models (PLMs) have achieved state-of-the-art on many natural language processing (NLP) tasks, they lack an understanding of subtle expressions of implicit hate speech. Various attempts have been made to enhance the detection of implicit hate by augmenting external context or enforcing label separation via distance-based metrics. Combining these two approaches, we introduce FiADD, a novel Focused Inferential Adaptive Density Discrimination framework. FiADD enhances the PLM finetuning pipeline by bringing the surface form/meaning of an implicit hate speech closer to its implied form while increasing the inter-cluster distance among various labels. We test FiADD on three implicit hate datasets and observe significant improvement in the two-way and three-way hate classification tasks. We further experiment on the generalizability of FiADD on three other tasks, detecting sarcasm, irony, and stance, in which surface and implied forms differ, and observe similar performance improvements. Consequently, we analyze the generated latent space to understand its evolution under FiADD, which corroborates the advantage of employing FiADD for implicit hate speech detection.

Sarah Masud, Ashutosh Bajpai, Tanmoy Chakraborty• 2023

Related benchmarks

TaskDatasetResultRank
Three-way hate classificationLatentHatred
Macro F10.5607
6
Three-way hate classificationImpGab
Macro F148.13
6
Three-way hate classificationAbuseEval
Macro F153.98
6
Two-way hate classificationLatentHatred
Macro F10.7135
6
Two-way hate classificationImpGab
Macro F170.27
6
Two-way hate classificationAbuseEval
Macro F10.7202
6
Irony DetectionIrony dataset
Macro F170.99
4
Sarcasm DetectionSarcasm Dataset
Macro F158.14
4
Stance DetectionStance dataset
Macro F157.76
4
Showing 9 of 9 rows

Other info

Follow for update