Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
GUI Action Prediction on Android Control Low
Loading...
94.1
TM
Qwen2.5-VL
63.94
71.77
79.6
87.43
Jun 25, 2025
TM
EM
Updated 1mo ago
Evaluation Results
Method
Method
Links
TM
EM
Qwen2.5-VL
Model Size=7B
2025.06
94.1
85
Aguvis
Model Size=7B
2025.06
93.9
89.4
Mobile-R1
Model Size=3B
2025.06
93.5
87.1
OS-Genesis
Model Size=7B
2025.06
90.7
74.2
OS-Atlas
Model Size=7B
2025.06
73
67.3
Odyssey
Model Size=7B
2025.06
65.1
39.2
Feedback
Search any
task
Search any
task