Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Malicious Prompt Detection on MMBench OOD

0.16FPR

VLMShield

-0.22682.38414.9957.6059Apr 7, 2026
Updated 9d ago

Evaluation Results

MethodLinks
2026.04
0.16
2026.04
0.16
2026.04
2
2026.04
2.34
2026.04
2.54
2026.04
2.54
2026.04
3.08
2026.04
4.93
2026.04
5
2026.04
5.36
2026.04
7.2
2026.04
8.75
2026.04
9.83
2026.04
9.83