Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Malicious Prompt Detection on JailbreakV_28K Text-based (test)

0FNR

VLMShield

-2.834816.300135.43554.5699Apr 7, 2026
Updated 9d ago

Evaluation Results

MethodLinks
2026.04
0
2026.04
0
2026.04
1.72
2026.04
3.88
2026.04
5.72
2026.04
8.4
2026.04
9.26
2026.04
16.18
2026.04
20.65
2026.04
20.65
2026.04
22.83
2026.04
26.33
2026.04
28.06
2026.04
48.53
2026.04
48.53
2026.04
70.87