Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Web Navigation on Mind2Web Service
Loading...
36.27
Success Rate
Mem-W-8B
5.6836
13.6243
21.565
29.5057
May 10, 2026
Success Rate
Updated 22d ago
Evaluation Results
Method
Method
Links
Success Rate
Mem-W-8B
Backbone=UI-Venus-1.5-8B
2026.05
36.27
GUI-Owl-1.5-8B
Model Class=Open-sourc...
2026.05
30.39
Qwen3-VL-32B
Model Class=Open-sourc...
2026.05
27.45
GLM-4.1V-9B-Thinking
Model Class=Open-sourc...
2026.05
26.47
Qwen3-VL-8B
Model Class=Open-sourc...
2026.05
26.47
Mem-W-4B
Backbone=Qwen3-VL-4B
2026.05
26.47
Qwen3-VL-2B
Model Class=Open-sourc...
2026.05
23.52
Mem-W-7B
Backbone=UI-TARS-1.5-7B
2026.05
22.55
MAI-UI-2B
Model Class=Open-sourc...
2026.05
21.57
Step-GUI-4B
Model Class=Open-sourc...
2026.05
21.56
Qwen2.5-VL-7B
Model Class=Open-sourc...
2026.05
17.65
UI-Venus-1.5-8B
Model Class=Mem-W Models
2026.05
15.69
Qwen3-VL-4B
Model Class=Mem-W Models
2026.05
14.71
UI-Venus-1.5-30B-A3B
Model Class=Open-sourc...
2026.05
8.82
UI-TARS-1.5-7B
Model Class=Mem-W Models
2026.05
6.86
Feedback
Search any
task
Search any
task