Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Hate Speech Classification on Internal Set
Loading...
97
Precision
GPT-4o
30.44
47.72
65
82.28
Dec 19, 2025
Precision
Recall
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
GPT-4o
2025.12
97
78
87
CoPE-A-9B
Parameters=9B
2025.12
93
87
90
CoPE-A-9B
Model size=9B
2025.12
89
93
91
LlamaGuard3-8B
Model size=8B
2025.12
88
64
74
ShieldGemma-9B
Model size=9B
2025.12
68
98
80
GPT-4o
2025.12
64
89
75
Llama-3.1-8B
Model size=8B
2025.12
59
96
73
Llama-3.1-8B
Parameters=8B
2025.12
33
96
49
Feedback
Search any
task
Search any
task