Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

UoT-UWF-PartAI at SemEval-2021 Task 5: Self Attention Based Bi-GRU with Multi-Embedding Representation for Toxicity Highlighter

About

Toxic Spans Detection(TSD) task is defined as highlighting spans that make a text toxic. Many works have been done to classify a given comment or document as toxic or non-toxic. However, none of those proposed models work at the token level. In this paper, we propose a self-attention-based bidirectional gated recurrent unit(BiGRU) with a multi-embedding representation of the tokens. Our proposed model enriches the representation by a combination of GPT-2, GloVe, and RoBERTa embeddings, which led to promising results. Experimental results show that our proposed approach is very effective in detecting span tokens.

Hamed Babaei Giglou, Taher Rahgooy, Mostafa Rahgouy, Jafar Razmara• 2021

Related benchmarks

TaskDatasetResultRank
Toxic Comment ClassificationJigsaw Toxic Comment Classification
Accuracy94.37
10
Toxicity ClassificationToxicity Dataset (test)
Test Accuracy94.37
9
Toxicity ClassificationJigsaw (test)
Accuracy94.4
6
Showing 3 of 3 rows

Other info

Follow for update