Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Safety Moderation on GuarEval Response
Loading...
79.4
F1 Score (Safety Moderation)
GGuard
29.48
42.44
55.4
68.36
Dec 22, 2025
F1 Score (Safety Moderation)
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score (Safety Moderation)
GGuard
Type=Response
2025.12
79.4
ShieldGemma
Type=Response
2025.12
70.6
OAI Moderator
Type=Response
2025.12
57.1
Llama Guard
Type=Response
2025.12
55.6
WildGuard
Type=Response
2025.12
31.4
Feedback
Search any
task
Search any
task