Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Embodied Instruction Following on Alfworld
Loading...
96.3
Progress Rate
Explicit RM
63.644
72.122
80.6
89.078
Feb 25, 2025
Progress Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Progress Rate
Explicit RM
Inference Strategy=Bea...
2025.02
96.3
Explicit RM
Inference Strategy=Bes...
2025.02
94.8
ImplicitPRM
Inference Strategy=Bes...
2025.02
94.8
QLASS
2025.02
82.8
StepAgent
2025.02
76.1
ETO
2025.02
73.4
SPIN
2025.02
71.9
Greedy Search
2025.02
71.6
NAT
2025.02
68.3
gpt-4o
2025.02
66.4
LLM-as-a-judge
Inference Strategy=Bes...
2025.02
64.9
Feedback
Search any
task
Search any
task