Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Prompt Classification on ToxiC
Loading...
81.9
F1 Score
Qwen3Guard-Gen
41.028
51.639
62.25
72.861
Jan 22, 2026
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Qwen3Guard-Gen
Model Size=4B, Evaluat...
2026.01
81.9
Qwen3Guard-Gen
Model Size=8B, Evaluat...
2026.01
81.3
Qwen3Guard-Gen
Model Size=0.6B, Evalu...
2026.01
76.5
YuFeng-XGuard
Model Size=0.6B
2026.01
74.4
NemotronGuardV2
Model Size=8B
2026.01
73
NemotronReasoning
Model Size=4B
2026.01
72.4
ShieldGemma
Model Size=9B
2026.01
72
YuFeng-XGuard
Model Size=8B
2026.01
71.1
GPT-OSS-SafeGuard
Model Size=20B
2026.01
71
PolyGuard
Model Size=7B
2026.01
68.8
WildGuard
Model Size=7B
2026.01
64.1
Qwen3Guard-Gen
Model Size=8B, Evaluat...
2026.01
63.7
Qwen3Guard-Gen
Model Size=4B, Evaluat...
2026.01
63.4
Qwen3Guard-Gen
Model Size=0.6B, Evalu...
2026.01
58.9
Llama3Guard
Model Size=8B
2026.01
48.4
Llama4Guard
Model Size=12B
2026.01
42.6
Feedback
Search any
task
Search any
task