Share your thoughts, 1 month free Claude Pro on usSee more

Jailbreak evaluation on violence prompts

99.9Non-Unsafe Rate

gpt-5-thinking

Updated 4mo ago

Evaluation Results

Method	Links
gpt-5-thinking 2025.12		99.9
OpenAI o3 2025.12		99.2
GPT-4o 2025.12		95.5
gpt-5-main 2025.12		94.8