UoT-UWF-PartAI at SemEval-2021 Task 5: Self Attention Based Bi-GRU with Multi-Embedding Representation for Toxicity Highlighter

About

Toxic Spans Detection(TSD) task is defined as highlighting spans that make a text toxic. Many works have been done to classify a given comment or document as toxic or non-toxic. However, none of those proposed models work at the token level. In this paper, we propose a self-attention-based bidirectional gated recurrent unit(BiGRU) with a multi-embedding representation of the tokens. Our proposed model enriches the representation by a combination of GPT-2, GloVe, and RoBERTa embeddings, which led to promising results. Experimental results show that our proposed approach is very effective in detecting span tokens.

Hamed Babaei Giglou, Taher Rahgooy, Mostafa Rahgouy, Jafar Razmara• 2021

Related benchmarks

Task	Dataset	Result
Toxic Comment Classification	Jigsaw Toxic Comment Classification	Accuracy94.37	10
Toxicity Classification	Toxicity Dataset (test)	Test Accuracy94.37	9
Toxicity Classification	Jigsaw (test)	Accuracy94.4	6

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord