Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Web Interaction on Mind2Web cross-domain
Loading...
26.97
Action Accuracy
Evolving-RL
7.1476
12.2938
17.44
22.5862
May 11, 2026
Action Accuracy
Success Rate (SR)
Updated 21d ago
Evaluation Results
Method
Method
Links
Action Accuracy
Success Rate (SR)
Evolving-RL
Skill injection=true
2026.05
26.97
1.91
Evolving-RL
Skill injection=false
2026.05
24.67
1.21
GRPO
Skill injection=false
2026.05
20.84
1.67
GRPO
Skill injection=true
2026.05
20.33
1.69
Memento
2026.05
13.13
0.88
Base Model
Skill injection=true
2026.05
12.68
0.5
ReasoningBank
2026.05
12.09
0.66
Base Model
Skill injection=false
2026.05
7.91
0.44
Feedback
Search any
task
Search any
task