Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Prompt Classification on SEval
Loading...
92.5
F1 Score
YuFeng-XGuard
55.684
65.242
74.8
84.358
Jan 22, 2026
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
YuFeng-XGuard
Model Size=8B
2026.01
92.5
YuFeng-XGuard
Model Size=0.6B
2026.01
91.7
Qwen3Guard-Gen
Model Size=4B, Evaluat...
2026.01
87
Qwen3Guard-Gen
Model Size=8B, Evaluat...
2026.01
86.6
PolyGuard
Model Size=7B
2026.01
86.3
GPT-OSS-SafeGuard
Model Size=20B
2026.01
85.4
Qwen3Guard-Gen
Model Size=0.6B, Evalu...
2026.01
85.1
NemotronReasoning
Model Size=4B
2026.01
83.5
WildGuard
Model Size=7B
2026.01
79.8
NemotronGuardV2
Model Size=8B
2026.01
77.7
Qwen3Guard-Gen
Model Size=0.6B, Evalu...
2026.01
74.6
Qwen3Guard-Gen
Model Size=4B, Evaluat...
2026.01
73.2
Qwen3Guard-Gen
Model Size=8B, Evaluat...
2026.01
72.7
Llama3Guard
Model Size=8B
2026.01
68
Llama4Guard
Model Size=12B
2026.01
60.4
ShieldGemma
Model Size=9B
2026.01
57.1
Feedback
Search any
task
Search any
task