Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Jailbreak evaluation on illicit non-violent crime prompts

99.5Not Unsafe Rate

gpt-5-thinking

93.15694.80396.4598.097Dec 19, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
99.5
2025.12
98.5
2025.12
93.7
2025.12
93.4