Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mobile GUI Automation on AW-Extend
Loading...
68.4
Success Rate
AgentProg
21.912
33.981
46.05
58.119
Dec 11, 2025
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
AgentProg
Backbone=Gemini-2.5-Pr...
2025.12
68.4
UI-TARS (UI-TARS-1.5-API)
Backbone=UI-TARS-1.5-API
2025.12
36.8
M3A (SoM)
Backbone=Gemini-2.5-Pr...
2025.12
28.9
Mobile-Agent-v3 (GUI-Owl-32B)
Backbone=GUI-Owl-32B
2025.12
28.9
Mobile-Agent-v3 (GUI-Owl-7B)
Backbone=GUI-Owl-7B
2025.12
26.3
M3A (a11y)
Backbone=Gemini-2.5-Pr...
2025.12
23.7
Feedback
Search any
task
Search any
task