Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Robotic Planning on EB-ALFRED (test)
Loading...
72
Base Score
RoboAgent
26.24
38.12
50
61.88
May 13, 2026
Base Score
Communication Score
Complexity Score
Visual Score
Spatial Score
Long-term Dependency Score
Average Score
Updated 20d ago
Evaluation Results
Method
Method
Links
Base Score
Communication Score
Complexity Score
Visual Score
Spatial Score
Long-term Dependency Score
Average Score
RoboAgent
2026.05
72
48
64
78
60
80
67
WAP
2026.05
66
62
70
56
52
70
62.7
RoboGPT-R1
2026.05
62
56
64
50
50
50
55.3
RoboEvolve
2026.05
60
52
70
62
52
74
61.7
Qwen3-VL
extension=Simulator S
2026.05
56
46
64
56
46
66
55.7
REBP
2026.05
54
42
46
28
38
6
35.6
Qwen3-VL
2026.05
28
20
26
32
20
26
25.3
Feedback
Search any
task
Search any
task