Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Policy Violation Detection on PhantomPolicy safe-control original

0.0333Violation Rate

GPT-5.4

0.0293330.0563330.0833330.110333Apr 14, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
0.0333
2026.04
0.0333
2026.04
0.05
2026.04
0.05
2026.04
0.1333