Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HateXplain

Benchmarks

Task NameDataset NameSOTA ResultTrend
Hate speech detectionHateXplain (test)
Macro F1 Score85.13
36
Hate speech classification and explainabilityHateXplain (test)
IOU F10.3176
22
Classification ProbingHateXplain (test)
Probe Accuracy (Best Layer)79.1
21
Toxicity DetectionHateXplain
AUC90.44
21
Hate Speech DetectionHateXplain (held-out)
F1 Score47.3
14
Hate Speech ClassificationHateXplain
Accuracy0.724
7
RationalizationHateXplain Synthetic Skew (test)
HI-F10.4281
7
ClassificationHateXplain Synthetic Skew (test)
Clf-F173.15
7
Hate Speech DetectionHateXplain Muslim-focused (test)
Accuracy75.1
7
Text ClassificationHateXplain (val)
Accuracy35.2
6
RationalizationHateXplain (test)
HI-F142.62
5
Hate Speech ClassificationHateXplain standard (test)
Accuracy68.6
5
Multi-label classificationHateXplain (test)
Hamming Loss5.89
3
Multi-label classificationHateXplain
Precision (macro)0.7519
3
multi target-group identificationHateXplain (test)
African Group BA76.83
3
Concept vector stabilityHateXplain
Mean Abs-Cosine Similarity0.98
3
Text ClassificationHateXplain (test)
Accuracy34.3
3
Showing 17 of 17 rows