Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Task performance (Action Score) on Robot
Loading...
63.49
Human Action Score
GOOD
43.0124
48.3287
53.645
58.9613
Aug 20, 2025
Human Action Score
LLM Action Score
Updated 26d ago
Evaluation Results
Method
Method
Links
Human Action Score
LLM Action Score
GOOD
Inference strategy=pro...
2025.08
63.49
75.93
GOOD
Inference strategy=pro...
2025.08
61.86
48.13
Full Context
Inference strategy=Ful...
2025.08
43.8
29.13
Feedback
Search any
task
Search any
task