Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Web Navigation on Reddit
Loading...
6.7
Success Rate (SR)
GPT-4o
0.252
1.926
3.6
5.274
May 29, 2026
Success Rate (SR)
Average Steps (AS)
Updated 2d ago
Evaluation Results
Method
Method
Links
Success Rate (SR)
Average Steps (AS)
GPT-4o
Model=GPT-4o, Strategy...
2026.05
6.7
17.6
SCALE
Model=Qwen2.5-VL-7B, S...
2026.05
4.8
8.4
InternVL2.5-8B
Model=InternVL2.5-8B,...
2026.05
4.3
9.7
SCALE
Model=InternVL2.5-8B,...
2026.05
3.3
7.2
Qwen2.5-VL-7B
Model=Qwen2.5-VL-7B, S...
2026.05
3.3
12
InternVL2.5-8B
Model=InternVL2.5-8B,...
2026.05
3
17.2
ViGoRL
Model=ViGoRL, Strategy...
2026.05
2.9
14.6
Qwen2.5-VL-7B
Model=Qwen2.5-VL-7B, S...
2026.05
2.4
16.1
SCALE-20k
Model=LLaVA-NeXT-8B, S...
2026.05
1.9
9.6
InternVL2.5-8B
Model=InternVL2.5-8B,...
2026.05
1.4
22.7
Qwen2.5-VL-7B
Model=Qwen2.5-VL-7B, S...
2026.05
1.4
12.7
LLaVA-NeXT-8B
Model=LLaVA-NeXT-8B, S...
2026.05
1.4
4.6
InternVL2.5-8B
Model=InternVL2.5-8B,...
2026.05
1
-
Qwen2.5-VL-7B
Model=Qwen2.5-VL-7B, S...
2026.05
1
-
Augvis
Model=Augvis, Strategy...
2026.05
0.5
-
Feedback
Search any
task
Search any
task