Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HateXplain

Benchmarks

Task NameDataset NameSOTA ResultTrend
Hate speech detectionHateXplain (test)
Macro F1 Score85.13
36
Feature AttributionHateXplain
Comprehensiveness85
33
Hate Speech DetectionHateXplain
F1-macro57
27
Hate speech classification and explainabilityHateXplain (test)
IOU F10.3176
22
Classification ProbingHateXplain (test)
Probe Accuracy (Best Layer)79.1
21
Hate Speech ClassificationHateXplain
Accuracy0.763
21
Toxicity DetectionHateXplain
AUC90.44
21
Hate Speech DetectionHateXplain (held-out)
F1 Score47.3
14
Hate speech classificationHateXplain (test)
Macro-F172
13
RationalizationHateXplain Synthetic Skew (test)
HI-F10.4281
7
ClassificationHateXplain Synthetic Skew (test)
Clf-F173.15
7
Hate Speech DetectionHateXplain Muslim-focused (test)
Accuracy75.1
7
Text ClassificationHateXplain (val)
Accuracy35.2
6
RationalizationHateXplain (test)
HI-F142.62
5
Hate Speech ClassificationHateXplain standard (test)
Accuracy68.6
5
Multi-label classificationHateXplain (test)
Hamming Loss5.89
3
Multi-label classificationHateXplain
Precision (macro)0.7519
3
multi target-group identificationHateXplain (test)
African Group BA76.83
3
Concept vector stabilityHateXplain
Mean Abs-Cosine Similarity0.98
3
Text ClassificationHateXplain (test)
Accuracy34.3
3
Showing 20 of 20 rows