Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ToxiGen

Benchmarks

Task NameDataset NameSOTA ResultTrend
Safety EvaluationToxiGen
Safety93.1
71
Toxicity DetectionToxiGen
Score81.4
25
Toxicity GenerationToxiGen
ToxiGen Score1,633
24
Toxicity ClassificationToxigen
Accuracy60.41
22
HarmlessnessToxigen
Toxigen (%)100
17
DetoxificationToxiGen (test)
MTV97.4
16
Influence EstimationToxiGen (test)
Spearman Correlation0.44
14
Bias DetectionToxigen (test)
Accuracy90.3
12
Safety EvaluationToxiGen Pretrained Evaluation
Toxicity Rate14.53
12
Toxicity DetectionTOXIGEN (val)
AUC96
8
Misuse DetectionToxiGen Homophobia (external)
TPR98
1
Misuse DetectionToxiGen Ethnoracial (external)
TPR91
1
Detoxification Dataset Quality EvaluationToxiGen 500 neutral-toxic pairs
Overall O.2.475
1
Showing 13 of 13 rows