Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mobile GUI Automation on AMEX
Loading...
77.57
Type Success Rate
GUI-R1-7B + CES
54.2116
60.2758
66.34
72.4042
Nov 27, 2025
Type Success Rate
GR
SR
Updated 4d ago
Evaluation Results
Method
Method
Links
Type Success Rate
GR
SR
GUI-R1-7B + CES
Method=Multi-Agent
2025.11
77.57
61.64
48.48
GUI-R1-7B + GPT-5
Method=Multi-Agent
2025.11
72.8
52.15
35.8
GUI-R1-7B
Method=RL
2025.11
67.26
57.12
43.69
GUI-Owl-7B
Method=RL
2025.11
61.56
48.38
40.48
UI-R1-3B
Method=RL
2025.11
60.23
41.78
35.81
Qwen2.5-VL-7B
Method=Zero Shot
2025.11
59.52
48.24
35.1
OS-Atlas-7B
Method=SFT
2025.11
55.11
40.3
33.89
Feedback
Search any
task
Search any
task