Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Safety Moderation on GuarEval Response
Loading...
79.4
F1 Score (Safety Moderation)
GGuard
29.48
42.44
55.4
68.36
Dec 22, 2025
F1 Score (Safety Moderation)
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 Score (Safety Moderation)
GGuard
Type=Response
2025.12
79.4
ShieldGemma
Type=Response
2025.12
70.6
OAI Moderator
Type=Response
2025.12
57.1
Llama Guard
Type=Response
2025.12
55.6
WildGuard
Type=Response
2025.12
31.4
Feedback
Search any
task
Search any
task