Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Hate Speech Classification on Ethos Benchmark
Loading...
91
Precision
GPT-4o
60.84
68.67
76.5
84.33
Dec 19, 2025
Precision
Recall
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
GPT-4o
2025.12
91
78
84
LlamaGuard3-8B
Model Size=8B
2025.12
87
79
83
ShieldGemma-9B
Model Size=9B
2025.12
82
82
82
CoPE-A-9B
Model Size=9B
2025.12
80
88
84
Llama-3.1-8B
Model Size=8B
2025.12
62
92
74
Feedback
Search any
task
Search any
task