Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Environment Interaction on Complex Instruction Tasks Interaction
Loading...
1.13
NE
MapGPT
1.1056
1.2703
1.435
1.5997
Apr 9, 2025
NE
TL
Success Rate (SR)
SPL
Updated 3mo ago
Evaluation Results
Method
Method
Links
NE
TL
Success Rate (SR)
SPL
MapGPT
2025.04
1.13
13.78
8
3
BrainNav
2025.04
1.74
7.45
25
26.4
Feedback
Search any
task
Search any
task