Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Prompt Injection Detection on IHEval Rule-following

0.01FPR

AlignSentinel (Enc-first)

0.00720.02610.0450.0639Feb 14, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
0.010.1
2026.02
0.020.16
2026.02
0.030.04
2026.02
0.060.09
2026.02
0.070.1
2026.02
0.080.14