Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Jailbreak Attack Success Evaluation on RedTeam2K SD+TYPO
Loading...
51
Attack Success Rate (ASR)
CMRM
7.424
18.737
30.05
41.363
Mar 18, 2026
Attack Success Rate (ASR)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Attack Success Rate (ASR)
CMRM
VLM=LLaVA-1.5-7B
2026.03
51
LLaVA-1.5-7B (Baseline)
VLM=LLaVA-1.5-7B
2026.03
49.8
InternVL-Chat-19B (Baseline)
VLM=InternVL-Chat-19B
2026.03
43.6
ShiftDC
VLM=LLaVA-1.5-7B
2026.03
42.5
ShareGPT4V-7B (Baseline)
VLM=ShareGPT4V-7B
2026.03
39.2
ECSO
VLM=LLaVA-1.5-7B
2026.03
28.6
ShiftDC
VLM=InternVL-Chat-19B
2026.03
27.6
ECSO
VLM=ShareGPT4V-7B
2026.03
26.1
AdaShield
VLM=LLaVA-1.5-7B
2026.03
22.2
JRS-Rem
VLM=LLaVA-1.5-7B
2026.03
21.9
CMRM
VLM=InternVL-Chat-19B
2026.03
21.4
ShiftDC
VLM=ShareGPT4V-7B
2026.03
20.1
ECSO
VLM=InternVL-Chat-19B
2026.03
19.8
AdaShield
VLM=InternVL-Chat-19B
2026.03
19.6
AdaShield
VLM=ShareGPT4V-7B
2026.03
15
CMRM
VLM=ShareGPT4V-7B
2026.03
12.1
JRS-Rem
VLM=InternVL-Chat-19B
2026.03
9.6
JRS-Rem
VLM=ShareGPT4V-7B
2026.03
9.1
Feedback
Search any
task
Search any
task