Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Web Task Completion on Gitlab
Loading...
87
Accuracy
JIT-Planner
45.4
56.2
67
77.8
May 20, 2026
Accuracy
Updated 13d ago
Evaluation Results
Method
Method
Links
Accuracy
JIT-Planner
2026.05
87
OpenAI CUA
2026.05
80
Browser-Use
cache=true
2026.05
67
Browser-Use
cache=false
2026.05
49
Anthropic CUA
2026.05
47
Feedback
Search any
task
Search any
task