Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Long-term Planning benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Long-term Planning
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
GUI-Odyssey
RL-Continuous-7B
Type Success Rate
66.26
14
3mo ago
AndroidControl High
SSL-7B
Type Rate
71.79
14
3mo ago
AgentBench LTP
Gemini-3-flash
Task Completion Score (TCS)
32.3
4
6d ago
Showing 3 of 3 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task