Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Safety Moderation on Safe RLHF EN
Loading...
93
F1 Score
MD-Judge
78.44
82.22
86
89.78
Mar 17, 2026
F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 Score
MD-Judge
Size=7B
2026.03
93
Wildguard
Size=7B
2026.03
93
FanarGuard
Size=4B
2026.03
93
PolyGuard-Ministral
Size=8B
2026.03
90
PolyGuard-Qwen
Size=7B
2026.03
90
Llama-Guard-3
Size=8B
2026.03
89
PolyGuard-Smol
Size=0.5B
2026.03
84
ShieldGemma-2b
Size=2B
2026.03
79
Feedback
Search any
task
Search any
task