Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Browser Use on WebTailBench
Loading...
63.5
Success Rate
Axtree Agent (Gemini-3-flash)
17.74
29.62
41.5
53.38
Apr 9, 2026
Success Rate
Updated 9d ago
Evaluation Results
Method
Method
Links
Success Rate
Axtree Agent (Gemini-3-flash)
Model Category=API onl...
2026.04
63.5
Gemini computer-use-preview
Model Category=API onl...
2026.04
63
Axtree Agent (Gemini-3-flash)
Model Category=API onl...
2026.04
62.1
SoM Agent (GPT-5)
Model Category=API onl...
2026.04
60.4
SoM Agent (o3)
Model Category=API onl...
2026.04
52.7
MolmoWeb-8B
Model Category=Open we...
2026.04
49.5
MolmoWeb-4B
Model Category=Open we...
2026.04
43.8
Fara-7B
Model Category=Open we...
2026.04
38.4
SoM Agent (GPT-4o)
Model Category=API onl...
2026.04
30.8
Axtree Agent (GPT-5)
Model Category=API onl...
2026.04
29.2
OpenAI computer-use-preview
Model Category=API onl...
2026.04
25.7
GLM-4.1V-9B-Thinking
Model Category=Open we...
2026.04
22.4
UI-TARS-1.5-7B
Model Category=Open we...
2026.04
19.5
Feedback
Search any
task
Search any
task