Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HateXplain

Benchmarks

Task NameDataset NameSOTA ResultTrend
Hate speech detectionHateXplain (test)
Macro F1 Score85.13
24
Hate speech classification and explainabilityHateXplain (test)
IOU F10.3176
22
Classification ProbingHateXplain (test)
Probe Accuracy (Best Layer)79.1
21
Toxicity DetectionHateXplain
AUC90.44
21
Hate Speech DetectionHateXplain (held-out)
F1 Score47.3
14
Hate Speech ClassificationHateXplain
Accuracy0.724
7
RationalizationHateXplain Synthetic Skew (test)
HI-F10.4281
7
ClassificationHateXplain Synthetic Skew (test)
Clf-F173.15
7
Hate Speech DetectionHateXplain Muslim-focused (test)
Accuracy75.1
7
Text ClassificationHateXplain (val)
Accuracy35.2
6
RationalizationHateXplain (test)
HI-F142.62
5
Hate Speech ClassificationHateXplain standard (test)
Accuracy68.6
5
Concept vector stabilityHateXplain
Mean Abs-Cosine Similarity0.98
3
Text ClassificationHateXplain (test)
Accuracy34.3
3
Showing 14 of 14 rows