Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Robot Failure Explanation on RoboFail
Loading...
0.628
Coherence Score (CS)
Gemini 2.5 Pro
0.44496
0.49248
0.54
0.58752
Jun 6, 2025
Coherence Score (CS)
ROUGE-L Score
LLM-J Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Coherence Score (CS)
ROUGE-L Score
LLM-J Score
Gemini 2.5 Pro
2025.06
0.628
0.342
0.55
AHA-13B
2025.06
0.471
0.28
0.465
RoboFAC-7B
2025.06
0.452
0.137
0.133
Feedback
Search any
task
Search any
task