Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-horizon tasks on Minecraft Gold
Loading...
21.69
Success Rate (SR)
EvoAgent
-0.8676
4.9887
10.845
16.7013
Feb 9, 2025
Success Rate (SR)
Efficiency Error (EE)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate (SR)
Efficiency Error (EE)
EvoAgent
2025.02
21.69
30.48
Optimus-1
2025.02
10.62
8.03
Jarvis-1
2025.02
8.84
9.76
LS-Imagine
2025.02
6.61
10.69
DreamerV3
2025.02
6.57
8.05
PPO
2025.02
0
0
GPT-4V
2025.02
0
0
Feedback
Search any
task
Search any
task