Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Malicious Prompt Detection on JailbreakV_28K Image-based (test)
Loading...
0.19
FNR
VLMShield
-3.004
18.5555
40.115
61.6745
Apr 7, 2026
FNR
Updated 9d ago
Evaluation Results
Method
Method
Links
FNR
VLMShield
Target VLM Architectur...
2026.04
0.19
VLMShield
Target VLM Architectur...
2026.04
0.19
ASTRA
Target VLM Architectur...
2026.04
2.14
ASTRA
Target VLM Architectur...
2026.04
5.21
VLMGuard
Target VLM Architectur...
2026.04
11.82
JailGuard
Target VLM Architectur...
2026.04
14
VLMGuard
Target VLM Architectur...
2026.04
16.37
MirrorCheck
Target VLM Architectur...
2026.04
17.19
MirrorCheck
Target VLM Architectur...
2026.04
17.19
JailGuard
Target VLM Architectur...
2026.04
22.05
SelfReminder
Target VLM Architectur...
2026.04
34.8
CIDER
Target VLM Architectur...
2026.04
37.2
CIDER
Target VLM Architectur...
2026.04
37.2
ECSO
Target VLM Architectur...
2026.04
39.68
ECSO
Target VLM Architectur...
2026.04
43.06
SelfReminder
Target VLM Architectur...
2026.04
80.04
Feedback
Search any
task
Search any
task