Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Attack Success Rate Evaluation on 40 diverse smartphone tasks Execute
Loading...
100
TASR
HG-IDA*
-4
23
50
77
Oct 9, 2025
TASR
RASR
Updated 9d ago
Evaluation Results
Method
Method
Links
TASR
RASR
HG-IDA*
Target Model=Gemini-2....
2025.10
100
100
HG-IDA*
Target Model=Deepseek-VL2
2025.10
80
20
HG-IDA*
Target Model=GPT-4o
2025.10
60
60
Prefix
Target Model=Gemini-2....
2025.10
60
40
DA
Target Model=Gemini-2....
2025.10
40
20
GCG
Target Model=Gemini-2....
2025.10
40
40
GCG
Target Model=LLaVA-One...
2025.10
40
40
HG-IDA*
Target Model=LLaVA-One...
2025.10
40
40
DA
Target Model=LLaVA-One...
2025.10
20
20
DA
Target Model=GPT-4o
2025.10
0
0
Prefix
Target Model=GPT-4o
2025.10
0
0
GCG
Target Model=GPT-4o
2025.10
0
0
DA
Target Model=Deepseek-VL2
2025.10
0
0
Prefix
Target Model=Deepseek-VL2
2025.10
0
0
GCG
Target Model=Deepseek-VL2
2025.10
0
0
Prefix
Target Model=LLaVA-One...
2025.10
0
0
Feedback
Search any
task
Search any
task