Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Hate speech classification on Latent Hate (test)
Loading...
69
Macro F1 Score
SMARTER (Llama_DPO-256)
23.24
35.12
47
58.88
Sep 18, 2025
Macro F1 Score
F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Macro F1 Score
F1 Score
SMARTER (Llama_DPO-256)
Training Paradigm=DPO,...
2025.09
69
100
SMARTER (T5_DPO-256)
Training Paradigm=DPO,...
2025.09
65
94
GPT-4.1
Training Paradigm=16-s...
2025.09
63
91
Llama_Full
Training Paradigm=Full...
2025.09
62
90
T5_Full
Training Paradigm=Full...
2025.09
61
88
ModernBERT
Training Paradigm=Full...
2025.09
61
88
GPT-4.1
Training Paradigm=Zero...
2025.09
60
87
GPT-5-chat
Training Paradigm=16-s...
2025.09
60
87
Qwen-32B
Training Paradigm=16-s...
2025.09
57
83
GPT-4o-mini
Training Paradigm=Zero...
2025.09
54
78
GPT-5-chat
Training Paradigm=Zero...
2025.09
51
74
Qwen-32B
Training Paradigm=Zero...
2025.09
47
68
GPT-4o-mini
Training Paradigm=16-s...
2025.09
25
36
Feedback
Search any
task
Search any
task