Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Hate speech classification on Implicit Hate (test)
Loading...
0.67
Macro F1
Llama_Full
0.1292
0.2696
0.41
0.5504
Sep 18, 2025
Macro F1
F1 Score (%)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Macro F1
F1 Score (%)
Llama_Full
Training Paradigm=Full...
2025.09
0.67
100
T5_Full
Training Paradigm=Full...
2025.09
0.64
96
ModernBERT
Training Paradigm=Full...
2025.09
0.64
96
SMARTER (Llama_DPO-256)
Training Paradigm=DPO,...
2025.09
0.6
90
SMARTER (T5_DPO-256)
Training Paradigm=DPO,...
2025.09
0.59
88
GPT-5-chat
Training Paradigm=Zero...
2025.09
0.58
87
GPT-4.1
Training Paradigm=Zero...
2025.09
0.57
85
Qwen-32B
Training Paradigm=Zero...
2025.09
0.49
73
Qwen-32B
Training Paradigm=16-s...
2025.09
0.48
72
GPT-4o-mini
Training Paradigm=Zero...
2025.09
0.42
63
GPT-5-chat
Training Paradigm=16-s...
2025.09
0.4
60
GPT-4.1
Training Paradigm=16-s...
2025.09
0.38
57
GPT-4o-mini
Training Paradigm=16-s...
2025.09
0.15
22
Feedback
Search any
task
Search any
task