Share your thoughts, 1 month free Claude Pro on usSee more

Jailbreak evaluation on illicit non-violent crime prompts

99.5Not Unsafe Rate

gpt-5-thinking

Updated 4mo ago

Evaluation Results

Method	Links
gpt-5-thinking 2025.12		99.5
OpenAI o3 2025.12		98.5
GPT-4o 2025.12		93.7
gpt-5-main 2025.12		93.4