Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Sexual Content Classification on Internal Set
Loading...
100
Precision
LlamaGuard3-8B
45.92
59.96
74
88.04
Dec 19, 2025
Precision
Recall
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
LlamaGuard3-8B
Model Size=8B
2025.12
100
43
60
CoPE-A-9B
Model Size=9B
2025.12
96
83
89
ShieldGemma-9B
Model Size=9B
2025.12
96
76
85
GPT-4o
2025.12
95
72
82
Llama-3.1-8B
Model Size=8B
2025.12
48
95
64
Feedback
Search any
task
Search any
task