Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Prompt classification on OverR
Loading...
41.9
F1 Score
YuFeng-XGuard
4.356
14.103
23.85
33.597
Jan 22, 2026
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
YuFeng-XGuard
Model Size=8B
2026.01
41.9
YuFeng-XGuard
Model Size=0.6B
2026.01
41.8
Llama3Guard
Model Size=8B
2026.01
36.7
Llama4Guard
Model Size=12B
2026.01
34
Qwen3Guard-Gen
Model Size=4B, Evaluat...
2026.01
22
Qwen3Guard-Gen
Model Size=8B, Evaluat...
2026.01
21.5
ShieldGemma
Model Size=9B
2026.01
15.4
Qwen3Guard-Gen
Model Size=0.6B, Evalu...
2026.01
12.6
NemotronReasoning
Model Size=4B
2026.01
10.6
PolyGuard
Model Size=7B
2026.01
10.3
Qwen3Guard-Gen
Model Size=8B, Evaluat...
2026.01
10
Qwen3Guard-Gen
Model Size=4B, Evaluat...
2026.01
9.7
WildGuard
Model Size=7B
2026.01
9.2
Qwen3Guard-Gen
Model Size=0.6B, Evalu...
2026.01
8.2
NemotronGuardV2
Model Size=8B
2026.01
6.6
GPT-OSS-SafeGuard
Model Size=20B
2026.01
5.8
Feedback
Search any
task
Search any
task