Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Robot Task Planning on AI2-THOR Elemental tasks
Loading...
100
Success Rate
SMART-LLM
93.76
95.38
97
98.62
Dec 19, 2025
Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate
SMART-LLM
Backbone=Llama2-70b
2025.12
100
RecipeMasterLLM
2025.12
94
Feedback
Search any
task
Search any
task