Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Robot Task Planning on REI-Bench Mixed REs
Loading...
28.5
Object Omission Rate
LLaMA3.1-8B + TOCC
26.376
40.713
55.05
69.387
May 16, 2025
Object Omission Rate
Execution Error Rate
Overall Error Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Object Omission Rate
Execution Error Rate
Overall Error Rate
LLaMA3.1-8B + TOCC
Base Model=LLaMA3.1-8B...
2025.05
28.5
37.9
66.4
LLaMA3.1-8B + AP
Base Model=LLaMA3.1-8B...
2025.05
31.3
39.7
71
LLaMA3.1-8B + ICL
Base Model=LLaMA3.1-8B...
2025.05
32.7
39
71.7
LLaMA3.1-8B + CoT
Base Model=LLaMA3.1-8B...
2025.05
34.9
34.2
69.1
LLaMA3.1-8B
Base Model=LLaMA3.1-8B...
2025.05
38.8
31.1
69.9
LLaMA3.1-8B - Context
Base Model=LLaMA3.1-8B...
2025.05
81.6
5.3
86.9
Feedback
Search any
task
Search any
task