Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
GUI Task Execution on Amex
Loading...
67.45
Success Rate
Atlas-Pro-7B
48.6364
53.5207
58.405
63.2893
Jan 27, 2026
Success Rate
Grounding Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
Grounding Rate
Atlas-Pro-7B
Method Category=Specia...
2026.01
67.45
67.78
UI-Venus-Navi-7B
Method Category=Specia...
2026.01
63.58
79.67
MAGNET
Method Category=Agenti...
2026.01
62.84
71.53
MAGNET
Method Category=Agenti...
2026.01
62.23
75.54
InfiGUI-R1-3B
Method Category=Specia...
2026.01
62
69.98
COAT
Method Category=Agenti...
2026.01
59.68
67.69
Agent-S
Method Category=Agenti...
2026.01
58.29
69.85
GUI-R1-7B
Method Category=Specia...
2026.01
49.36
59.42
Feedback
Search any
task
Search any
task