Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Attack Success Rate Evaluation on 40 diverse smartphone tasks Generate subcategory
Loading...
75
Target ASR
HG-IDA*
-3
17.25
37.5
57.75
Oct 9, 2025
Target ASR
Response ASR
Updated 9d ago
Evaluation Results
Method
Method
Links
Target ASR
Response ASR
HG-IDA*
Target Model=GPT-4o
2025.10
75
50
HG-IDA*
Target Model=Gemini-2....
2025.10
75
75
HG-IDA*
Target Model=Deepseek-VL2
2025.10
75
25
DA
Target Model=Gemini-2....
2025.10
50
0
Prefix
Target Model=Gemini-2....
2025.10
25
0
GCG
Target Model=Gemini-2....
2025.10
25
25
GCG
Target Model=Deepseek-VL2
2025.10
25
25
DA
Target Model=LLaVA-One...
2025.10
25
25
GCG
Target Model=LLaVA-One...
2025.10
25
25
HG-IDA*
Target Model=LLaVA-One...
2025.10
25
25
DA
Target Model=GPT-4o
2025.10
0
0
Prefix
Target Model=GPT-4o
2025.10
0
0
GCG
Target Model=GPT-4o
2025.10
0
0
DA
Target Model=Deepseek-VL2
2025.10
0
0
Prefix
Target Model=Deepseek-VL2
2025.10
0
0
Prefix
Target Model=LLaVA-One...
2025.10
0
0
Feedback
Search any
task
Search any
task