Share your thoughts, 1 month free Claude Pro on usSee more

Jailbreak evaluation on abuse, disinformation, hate prompts

99.9Not Unsafe Rate

gpt-5-thinking

Updated 4mo ago

Evaluation Results

Method	Links
gpt-5-thinking 2025.12		99.9
OpenAI o3 2025.12		99.5
GPT-4o 2025.12		98.1
gpt-5-main 2025.12		97.8