Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
GUI Automation on MiniWoB++ All - 53 types (test)
Loading...
900
Token Usage
AutoRPA (code only)
528
3,039
5,550
8,061
May 20, 2026
Token Usage
Success Rate
Updated 13d ago
Evaluation Results
Method
Method
Links
Token Usage
Success Rate
AutoRPA (code only)
LLM Backbone=GPT-4.1
2026.05
900
92.5
AutoRPA
LLM Backbone=GPT-4.1
2026.05
1,400
95.4
AdaPlanner (one demo)
LLM Backbone=GPT-4.1,...
2026.05
4,500
74.3
AutoManual
LLM Backbone=GPT-4.1
2026.05
4,600
95.2
AdaPlanner
LLM Backbone=GPT-4.1
2026.05
6,100
90.3
ReAct†
LLM Backbone=GPT-4.1
2026.05
9,200
92.8
RCI
LLM Backbone=GPT-4.1
2026.05
10,200
87.2
Feedback
Search any
task
Search any
task