Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Prompt Classification on XSTest Text Prompt
Loading...
93.71
F1 Score
GuardReasoner-8B
78.214
82.237
86.26
90.283
Dec 29, 2025
F1 Score
Updated 2d ago
Evaluation Results
Method
Method
Links
F1 Score
GuardReasoner-8B
2025.12
93.71
Gemini2.5-Flash
2025.12
90.91
GPT-OSS-SafeGuard-20B
2025.12
90.81
GuardReasonerVL-7B
2025.12
90.59
LlamaGuard2-8B
2025.12
90.57
LlamaGuard3-8B
2025.12
89.98
ProGuard-3B
2025.12
88.25
ProGuard-7B
2025.12
87.9
LlamaGuard3-11B-Vision
2025.12
87.19
LlamaGuard4-12B
2025.12
86.64
GPT4o-mini
2025.12
85.98
LlamaGuard-7B
2025.12
83.42
WildGuard-7B
2025.12
81.91
ShieldGemma-9B
2025.12
78.81
Feedback
Search any
task
Search any
task