Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
GUI Reasoning on AndroidControl Low
Loading...
82.29
Type
CAPO
61.2196
66.6898
72.16
77.6302
Dec 2, 2025
Type
GR
SR
Updated 3d ago
Evaluation Results
Method
Method
Links
Type
GR
SR
CAPO
Prompting=zero-shot
2025.12
82.29
81.19
61.41
GRPO
Prompting=zero-shot
2025.12
82.13
80.15
63.87
Os-Atlas-4B
Prompting=zero-shot
2025.12
64.58
71.19
40.62
QwenVL2.5-3B
Prompting=zero-shot
2025.12
62.03
74.07
59.32
Feedback
Search any
task
Search any
task