Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mobile GUI Automation on AITZ
Loading...
64.44
Type Success Rate
GUI-R1-7B + CES
33.0216
41.1783
49.335
57.4917
Nov 27, 2025
Type Success Rate
Goal Rate
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Type Success Rate
Goal Rate
Success Rate
GUI-R1-7B + CES
Method=Multi-Agent
2025.11
64.44
64.58
43.05
GUI-R1-7B + GPT-5
Method=Multi-Agent
2025.11
62.5
59.1
40.55
GUI-Owl-7B
Method=RL
2025.11
53.86
52.08
32.7
GUI-R1-7B
Method=RL
2025.11
52.73
54.92
30.59
UI-R1-3B
Method=RL
2025.11
41.63
49.27
24.55
OS-Atlas-7B
Method=SFT
2025.11
38.52
44.14
25.97
Qwen2.5-VL-7B
Method=Zero Shot
2025.11
34.23
55.27
18.11
Feedback
Search any
task
Search any
task