UoT-UWF-PartAI at SemEval-2021 Task 5: Self Attention Based Bi-GRU with Multi-Embedding Representation for Toxicity Highlighter
About
Toxic Spans Detection(TSD) task is defined as highlighting spans that make a text toxic. Many works have been done to classify a given comment or document as toxic or non-toxic. However, none of those proposed models work at the token level. In this paper, we propose a self-attention-based bidirectional gated recurrent unit(BiGRU) with a multi-embedding representation of the tokens. Our proposed model enriches the representation by a combination of GPT-2, GloVe, and RoBERTa embeddings, which led to promising results. Experimental results show that our proposed approach is very effective in detecting span tokens.
Hamed Babaei Giglou, Taher Rahgooy, Mostafa Rahgouy, Jafar Razmara• 2021
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Toxic Comment Classification | Jigsaw Toxic Comment Classification | Accuracy94.37 | 10 | |
| Toxicity Classification | Toxicity Dataset (test) | Test Accuracy94.37 | 9 | |
| Toxicity Classification | Jigsaw (test) | Accuracy94.4 | 6 |
Showing 3 of 3 rows