Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Safety Classification on ToxicChat jailbreaking
Loading...
70.54
Macro F1
Gliner-Guard-Omni
1.6712
19.5506
37.43
55.3094
May 28, 2026
Macro F1
Updated 5d ago
Evaluation Results
Method
Method
Links
Macro F1
Gliner-Guard-Omni
2026.05
70.54
Opir-multitask-large
2026.05
66.34
Nemotron Safety Guard v3
2026.05
63.23
PolyGuard-Qwen
2026.05
58.45
PolyGuard-Qwen-Smol
2026.05
57.86
WildGuard (vLLM)
2026.05
57.13
Qwen3Guard-Gen-8B
2026.05
57.01
GLiGuard-LLMGuardrails-300M
2026.05
43.57
Opir-edge-multilang
2026.05
39.51
Opir-multitask-multilang
2026.05
19.3
Opir-edge
2026.05
4.32
Feedback
Search any
task
Search any
task