Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Embodied Task Completion on ALFWorld in-distribution held-out (test)
Loading...
97.7
Pick Success
Meta-RL
91.668
93.234
94.8
96.366
Dec 18, 2025
Pick Success
Look Success
Clean Success
Heat Success
Updated 4d ago
Evaluation Results
Method
Method
Links
Pick Success
Look Success
Clean Success
Heat Success
Meta-RL
Framework=LAMER, Train...
2025.12
97.7
100
90.2
89.5
RL
Framework=LAMER, Train...
2025.12
95.5
83
67.9
86.6
Prompting
Type=Prompting-based
2025.12
91.9
52.9
48.4
44.8
Feedback
Search any
task
Search any
task