Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
GUI Agent Automation on MiniWoB++ (Online)
Loading...
59.7
Success Rate
VeriGUI-7B
19.66
30.055
40.45
50.845
Apr 7, 2026
Success Rate
Updated 11d ago
Evaluation Results
Method
Method
Links
Success Rate
VeriGUI-7B
Category=Ours
2026.04
59.7
UI-S1-7B
Category=Open-source 7B
2026.04
56.6
Qwen2.5-VL-7B
Category=Open-source 7B
2026.04
47
VeriGUI-3B
Category=Ours
2026.04
35.6
UI-R1-3B
Category=Open-source 3B
2026.04
33.4
Qwen2.5-VL-3B
Category=Open-source 3B
2026.04
21.2
Feedback
Search any
task
Search any
task