Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
GUI Agent on WebVoyager
Loading...
51.84
Success Rate
AUTOMMEMO
23.812
31.0885
38.365
45.6415
May 16, 2026
Success Rate
Updated 15d ago
Evaluation Results
Method
Method
Links
Success Rate
AUTOMMEMO
Execution Model=Qwen3-...
2026.05
51.84
ALMA
Execution Model=Qwen3-...
2026.05
45.73
ReasoningBank
Execution Model=Qwen3-...
2026.05
45.36
TrajectoryRetrieval
Execution Model=Qwen3-...
2026.05
43.9
M2
Execution Model=Qwen3-...
2026.05
43.59
AUTOMMEMO
Execution Model=GPT-5....
2026.05
42.3
XSkill
Execution Model=Qwen3-...
2026.05
41.04
M2
Execution Model=GPT-5....
2026.05
38.33
NoMemory
Execution Model=GPT-5....
2026.05
37.89
ALMA
Execution Model=GPT-5....
2026.05
37.47
G-Memory
Execution Model=Qwen3-...
2026.05
37.35
G-Memory
Execution Model=GPT-5....
2026.05
36.5
NoMemory
Execution Model=Qwen3-...
2026.05
36.32
ReasoningBank
Execution Model=GPT-5....
2026.05
35.51
TrajectoryRetrieval
Execution Model=GPT-5....
2026.05
34.82
XSkill
Execution Model=GPT-5....
2026.05
24.89
Feedback
Search any
task
Search any
task