Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Hate Speech Detection on HateXplain (held-out)
Loading...
47.3
F1 Score
ExpNet
26.812
32.131
37.45
42.769
Jan 20, 2026
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
ExpNet
Training Data=SST-2 +...
2026.01
47.3
GradCAM
Training Data=SST-2 +...
2026.01
39.6
GAE
Training Data=SST-2 +...
2026.01
39.1
MGAE
Training Data=SST-2 +...
2026.01
39.1
LRP
Training Data=SST-2 +...
2026.01
37.2
RawAt
Training Data=SST-2 +...
2026.01
36.2
Rollout
Training Data=SST-2 +...
2026.01
35.6
FullLRP
Training Data=SST-2 +...
2026.01
34.6
Integrated Gradient
Training Data=SST-2 +...
2026.01
34.5
AttCAT
Training Data=SST-2 +...
2026.01
34
CAM
Training Data=SST-2 +...
2026.01
33.2
RandomBaseline
Training Data=SST-2 +...
2026.01
29.3
LIME
Training Data=SST-2 +...
2026.01
29
SHAP
Training Data=SST-2 +...
2026.01
27.6
Feedback
Search any
task
Search any
task