Abuse is Contextual, What about NLP? The Role of Context in Abusive Language Annotation and Detection

About

The datasets most widely used for abusive language detection contain lists of messages, usually tweets, that have been manually judged as abusive or not by one or more annotators, with the annotation performed at message level. In this paper, we investigate what happens when the hateful content of a message is judged also based on the context, given that messages are often ambiguous and need to be interpreted in the context of occurrence. We first re-annotate part of a widely used dataset for abusive language detection in English in two conditions, i.e. with and without context. Then, we compare the performance of three classification algorithms obtained on these two types of dataset, arguing that a context-aware classification is more challenging but also more similar to a real application scenario.

Stefano Menini, Alessio Palmero Aprosio, Sara Tonelli• 2021

Related benchmarks

Task	Dataset	Result
Toxicity Detection	CAD (full)	F1 Score50.9	11
Toxicity Detection	BBF	F1 Score62.3	11
Toxicity Detection	BAD	F1 Score74.7	11
Toxicity Detection	HQG	F1 Score90.7	10
Toxicity Detection	HQR	F1 Score79.9	10
Toxicity Detection	CAD context	F1 Score55.3	10
Toxicity Detection	FBK full	F1 Score39	10
Toxicity Detection	FBK flipped	F1 Score19.2	10

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord