Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Embodied Task Planning on EB-Habitat (OOD)
Loading...
59
Success Rate
GPT-4o
18.44
28.97
39.5
50.03
Apr 9, 2026
Success Rate
Updated 9d ago
Evaluation Results
Method
Method
Links
Success Rate
GPT-4o
2026.04
59
Claude-3.7-Sonnet
2026.04
58.7
Gemini-1.5-Pro
2026.04
56.3
RoboAgent
2026.04
22.3
RoboGPT-R1
2026.04
22
REBP
2026.04
20
Feedback
Search any
task
Search any
task