Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Embodied Planning on ALFWorld OOD v1
Loading...
12.3
Success Rate
GFlowVLM w/ SubTB
2.628
5.139
7.65
10.161
Mar 9, 2025
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
GFlowVLM w/ SubTB
Assump.=NM, SFT Initia...
2025.03
12.3
GFlowVLM w/ Var-TB
Assump.=NM, SFT Initia...
2025.03
10.9
RL4VLM
Assump.=NM, SFT Initia...
2025.03
6.1
RL4VLM
Assump.=M, SFT Initial...
2025.03
4.8
SFT-w/o-
Assump.=-
2025.03
3.3
SFT-w/-
Assump.=-
2025.03
3
Feedback
Search any
task
Search any
task