Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Robot Task Planning on AI2-THOR Elemental tasks
Loading...
100
Success Rate
SMART-LLM
93.76
95.38
97
98.62
Dec 19, 2025
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
SMART-LLM
Backbone=Llama2-70b
2025.12
100
RecipeMasterLLM
2025.12
94
Feedback
Search any
task
Search any
task