Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Direct Prompt Injection robustness on Agent Security Bench DPI

19ASR

Phi-4

16.7232.1147.562.89Mar 3, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
1981
2026.03
2179
2026.03
2667
2026.03
2872
2026.03
2971
2026.03
420
2026.03
4258
2026.03
4646
2026.03
5542
2026.03
760