Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Abuse is Contextual, What about NLP? The Role of Context in Abusive Language Annotation and Detection

About

The datasets most widely used for abusive language detection contain lists of messages, usually tweets, that have been manually judged as abusive or not by one or more annotators, with the annotation performed at message level. In this paper, we investigate what happens when the hateful content of a message is judged also based on the context, given that messages are often ambiguous and need to be interpreted in the context of occurrence. We first re-annotate part of a widely used dataset for abusive language detection in English in two conditions, i.e. with and without context. Then, we compare the performance of three classification algorithms obtained on these two types of dataset, arguing that a context-aware classification is more challenging but also more similar to a real application scenario.

Stefano Menini, Alessio Palmero Aprosio, Sara Tonelli• 2021

Related benchmarks

TaskDatasetResultRank
Toxicity DetectionCAD (full)
F1 Score50.9
11
Toxicity DetectionBBF
F1 Score62.3
11
Toxicity DetectionBAD
F1 Score74.7
11
Toxicity DetectionHQG
F1 Score90.7
10
Toxicity DetectionHQR
F1 Score79.9
10
Toxicity DetectionCAD context
F1 Score55.3
10
Toxicity DetectionFBK full
F1 Score39
10
Toxicity DetectionFBK flipped
F1 Score19.2
10
Showing 8 of 8 rows

Other info

Follow for update