Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Full-response Safety Guardrail Classification on BeaverTails (test)
Loading...
84.3
F1 Score
Qwen3Guard-8B
67.036
71.518
76
80.482
Jun 1, 2026
F1 Score
Updated 1d ago
Evaluation Results
Method
Method
Links
F1 Score
Qwen3Guard-8B
2026.06
84.3
WildGuard-7B
2026.06
83.8
GPT-5.5
protocol=zero-shot
2026.06
81.8
SentGuard
backbone=Qwen3-4B-Inst...
2026.06
81.2
Gemini-3.5-Flash
protocol=zero-shot
2026.06
70.2
LlamaGuard4-12B
2026.06
69.8
LlamaGuard3-8B
2026.06
67.7
Feedback
Search any
task
Search any
task