Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Robot Task Planning on REI-Bench Implicit REs
Loading...
40.1
Object Omission Rate
LLaMA3.1-8B + TOCC
38.3
50.45
62.6
74.75
May 16, 2025
Object Omission Rate
Execution Error Rate
Overall Error Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Object Omission Rate
Execution Error Rate
Overall Error Rate
LLaMA3.1-8B + TOCC
Base Model=LLaMA3.1-8B...
2025.05
40.1
30.6
70.7
LLaMA3.1-8B + CoT
Base Model=LLaMA3.1-8B...
2025.05
47.6
30.3
77.9
LLaMA3.1-8B + AP
Base Model=LLaMA3.1-8B...
2025.05
49.9
27.4
77.3
LLaMA3.1-8B + ICL
Base Model=LLaMA3.1-8B...
2025.05
49.9
28.7
78.6
LLaMA3.1-8B
Base Model=LLaMA3.1-8B...
2025.05
53.9
24
77.9
LLaMA3.1-8B - Context
Base Model=LLaMA3.1-8B...
2025.05
85.1
5.5
90.6
Feedback
Search any
task
Search any
task