Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Phrase Protection on Instruction Hierarchy (test)

97.5User Message Protection Accuracy

OpenAI o3

60.47670.08879.789.312Dec 19, 2025
Updated 3mo ago

Evaluation Results

MethodLinks
2025.12
97.592.1
9491.1
2025.12
73.544.9
2025.12
61.940.4