Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Safety Evaluation on APGD-JailBreak
Loading...
3.31
ASR (Unconstrained)
CARE
0.558
19.134
37.71
56.286
Mar 28, 2026
ASR (Unconstrained)
ASR (kappa=16/255)
ASR (kappa=32/255)
ASR (kappa=64/255)
Updated 19d ago
Evaluation Results
Method
Method
Links
ASR (Unconstrained)
ASR (kappa=16/255)
ASR (kappa=32/255)
ASR (kappa=64/255)
CARE
Backbone=Qwen2.5-VL
2026.03
3.31
11.17
8.82
6.13
CARE
Backbone=LLaVA-OneVision
2026.03
6.37
10.75
8.23
8.18
ASTRA
Backbone=LLaVA-OneVision
2026.03
7.26
13.3
12.13
10.24
ASTRA
Backbone=Qwen2.5-VL
2026.03
9.47
17.54
13.35
12.26
Original model
Backbone=Qwen2.5-VL
2026.03
69.37
60.63
64.45
73.32
Original model
Backbone=LLaVA-OneVision
2026.03
72.11
63.18
66.43
70.7
Feedback
Search any
task
Search any
task