Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Subgoal Planning on LoTa-WAH OOD
Loading...
37.4
SSR
GPT-4
2.352
11.451
20.55
29.649
Apr 9, 2026
SSR
Updated 9d ago
Evaluation Results
Method
Method
Links
SSR
GPT-4
2026.04
37.4
GPT-3.5-turbo
2026.04
36
RoboAgent
2026.04
22.1
LLaMA-30B
Parameters=30B
2026.04
10.4
LLaMA-7B
Parameters=7B
2026.04
3.7
Feedback
Search any
task
Search any
task