Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Open-Ended Instruction Task Execution on Minecraft Open-Ended Instruction Tasks (test)
Loading...
75
Torch Success Rate
GOAP
2.2
21.1
40
58.9
Feb 27, 2025
Torch Success Rate
Rail Success Rate
Golden Shovel Success Rate
Diamond Pickaxe Success Rate
Compass Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Torch Success Rate
Rail Success Rate
Golden Shovel Success Rate
Diamond Pickaxe Success Rate
Compass Success Rate
GOAP
Planner=GPT-4V, Policy...
2025.02
75
47
13
16
17
GOAP
Planner=GLM-4V, Policy...
2025.02
71
39
11
14
13
STEVE-1
Planner=GPT-4V, Policy...
2025.02
66
10
0
0
0
STEVE-1
Planner=GLM-4V, Policy...
2025.02
60
0
0
0
0
VPT (text)
Planner=GPT-4V, Policy...
2025.02
11
0
0
0
0
VPT (text)
Planner=GLM-4V, Policy...
2025.02
5
0
0
0
0
Feedback
Search any
task
Search any
task