Share your thoughts, 1 month free Claude Pro on usSee more

Jailbreak Detection on JailBreakBench Single Turn 35

98F1 Score

DeepContext

Updated 5mo ago

Evaluation Results

Method	Links
DeepContext 2026.02		98	100	95
Qwen3Guard-Gen 2026.02		88	95	83
Llama-Guard-4 2026.02		86	86	86
Gpt5 2026.02		83	91	76
GCP Model Armor 2026.02		83	96	74
Granite-Guardian-3.3 2026.02		78	100	65
Llama-Prompt-Guard-2 2026.02		59	50	72
Deberta-v3-Prompt-Injection 2026.02		54	57	50
AWS Prompt Attack Guardrails 2026.02		8	4	100
Azure Prompt Shield 2026.02		0	0	0