Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Safety Moderation on Beaver Response
Loading...
84.4
F1 Score
WildGuard
64.224
69.462
74.7
79.938
Dec 22, 2025
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
WildGuard
Type=Response
2025.12
84.4
GGuard
Type=Response
2025.12
83
ShieldGemma
Type=Response
2025.12
77.1
Llama Guard
Type=Response
2025.12
71.8
OAI Moderator
Type=Response
2025.12
65
Feedback
Search any
task
Search any
task