Share your thoughts, 1 month free Claude Pro on usSee more

Phrase Protection on Instruction Hierarchy (test)

97.5User Message Protection Accuracy

OpenAI o3

Updated 5mo ago

Evaluation Results

Method	Links
OpenAI o3 2025.12		97.5	92.1
gpt-5-thinking 2025.12		94	91.1
GPT-4o 2025.12		73.5	44.9
gpt-5-main 2025.12		61.9	40.4