Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Binary Safety Classification on wildguard prompt safety
Loading...
97.91
Macro F1
Opir-multitask-large
71.6084
78.4367
85.265
92.0933
May 28, 2026
Macro F1
Updated 5d ago
Evaluation Results
Method
Method
Links
Macro F1
Opir-multitask-large
2026.05
97.91
Opir-edge-multilang
2026.05
94.86
Qwen3Guard-Gen-8B
2026.05
90.95
WildGuard (vLLM)
2026.05
90.37
PolyGuard-Qwen
2026.05
90.29
Opir-edge
2026.05
89.88
PolyGuard-Qwen-Smol
2026.05
89
Opir-multitask-multilang
2026.05
88.84
GLiGuard-LLMGuardrails-300M
2026.05
87.28
Nemotron Safety Guard v3
2026.05
85.94
Gliner-Guard-Omni
2026.05
72.62
Feedback
Search any
task
Search any
task