Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Instruction Hierarchy Robustness on Human Red-teaming IH-Challenge

271Number of Tasks

GPT-5-Mini

264.76266.38268269.62Mar 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
27132.8436.21.5
2026.03
26843.087.10.2
2026.03
26552.3911.70.4