Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Response Classification on Aegis Text Response 2.0
Loading...
82.27
F1 Score
ProGuard-7B
63.4876
68.3638
73.24
78.1162
Dec 29, 2025
F1 Score
Updated 3d ago
Evaluation Results
Method
Method
Links
F1 Score
ProGuard-7B
2025.12
82.27
MD-Judge-7B
2025.12
80.57
ProGuard-3B
2025.12
80.51
Gemini2.5-Flash
2025.12
80.44
DynaGuard-8B
2025.12
80.34
GPT4o-mini
2025.12
79.22
GuardReasoner-8B
2025.12
79.13
GPT-OSS-SafeGuard-20B
2025.12
77.66
GuardReasonerVL-7B
2025.12
74.64
LlamaGuard2-8B
2025.12
73.53
LlamaGuard-7B
2025.12
72.58
LlamaGuard3-8B
2025.12
72.33
LlamaGuard4-12B
2025.12
71.82
WildGuard-7B
2025.12
71.45
LlamaGuard3-11B-Vision
2025.12
69.63
ShieldGemma-9B
2025.12
64.21
Feedback
Search any
task
Search any
task