Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Toxicity Classification benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Toxicity Classification
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
CivilComments sensitive attribute: MUSLIM (test)
T5TT-4-ERM
Balanced Accuracy
59.9
57
1mo ago
Average across WZ, DC, HX, OR
Ideal (Gold Label)
Harmonic F1
48.8
26
1mo ago
OR
Ideal (Gold Labels)
Harmonic F1
49.7
26
1mo ago
HX
Mistral-v0.3-FS
H.-F1
44.2
26
1mo ago
DC
ToxiGAN
Harmonic Mean F1
31
26
1mo ago
WZ
Back-Translate
Harmonic F1
73.4
26
1mo ago
ToxCMM
ToxVidLM
F1 Score
94.35
24
1mo ago
Toxigen
MAT-STEER
Accuracy
60.41
22
1mo ago
Personification GPT-3 prompted (test)
V-REx
Loss
0.69
16
1mo ago
RealToxicity Prompts GPT-3 prompted (test)
V-REx
Loss
0.61
16
1mo ago
CivilComments (CC) (test)
gDRO
Worst-Group Accuracy
79.66
13
1mo ago
ToxiCN (test)
MacBERT + ToxiTrace
Accuracy
83.87
12
4d ago
COLD (test)
RoBERTa + ToxiTrace
Accuracy
83.84
12
4d ago
Toxicity Dataset (test)
CoGate-LSTM
Test Accuracy
96
9
11d ago
Jigsaw (test)
CoGate-LSTM
Accuracy
96
6
11d ago
Civil reconstructed with controlled shortcut injection (test)
JTT
MSTPS
54.9
5
4d ago
Toxicity
AdvDemo + CW
Original Accuracy
90.4
4
1mo ago
Toxicity Classification (test)
Template Attack
ASR
88.4
4
1mo ago
English Civil Comments
PaLM 2
AUC-ROC
85.35
4
1mo ago
Multilingual Jigsaw
PaLM 2
French Accuracy
87.94
4
1mo ago
DICES 3-class (item-level)
DiADEM
Accuracy (Acc)
67.79
3
9d ago
VOICED item-level (binary)
DiADEM
Accuracy
80
3
9d ago
CivilComments
ERM
Average Accuracy
92.6
3
1mo ago
ImplicitHate
Llama-3.2-3B-Instruct (SRD)
Accuracy
83.97
2
1mo ago
Jigsaw-ML
No AT
AUC
98.4
2
1mo ago
Showing 25 of 28 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs