Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Prompt Classification on OpenAI Moderation Text Prompt
Loading...
88.89
F1 Score
LlamaGuard2-8B
69.6916
74.6758
79.66
84.6442
Dec 29, 2025
F1 Score
Updated 2d ago
Evaluation Results
Method
Method
Links
F1 Score
LlamaGuard2-8B
2025.12
88.89
GPT4o-mini
2025.12
88.64
GPT-OSS-SafeGuard-20B
2025.12
87.42
LlamaGuard3-8B
2025.12
86.5
Gemini2.5-Flash
2025.12
86.1
LlamaGuard-7B
2025.12
85.05
LlamaGuard4-12B
2025.12
84.83
ProGuard-7B
2025.12
83.05
LlamaGuard3-11B-Vision
2025.12
82.71
GuardReasonerVL-7B
2025.12
78.74
ShieldGemma-9B
2025.12
78.69
GuardReasoner-8B
2025.12
77.67
ProGuard-3B
2025.12
77.02
WildGuard-7B
2025.12
70.43
Feedback
Search any
task
Search any
task