Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Jailbreak evaluation on illicit non-violent crime prompts

99.5Not Unsafe Rate

gpt-5-thinking

93.15694.80396.4598.097Dec 19, 2025
Updated 4d ago

Evaluation Results

MethodLinks
99.5
2025.12
98.5
2025.12
93.7
2025.12
93.4