Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Robot Task Completion on VLA-arena Level-1 (episodes 15-49)
Loading...
25.9
Success Rate (Extr.)
Premover
7.908
12.579
17.25
21.921
May 12, 2026
Success Rate (Extr.)
Success Rate (Distr.)
Success Rate (Safe)
Success Rate (LongH)
Success Rate (Mean)
Wall-Clock Time (All) (Extr.)
Wall-Clock Time (All) (Distr.)
Wall-Clock Time (All) (Safe)
Wall-Clock Time (All) (LongH)
Wall-Clock Time (All) (Mean)
Wall-Clock Time (Succ.) (Extr.)
Wall-Clock Time (Succ.) (Distr.)
Wall-Clock Time (Succ.) (Safe)
Wall-Clock Time (Succ.) (LongH)
Wall-Clock Time (Succ.) (Mean)
Updated 21d ago
Evaluation Results
Method
Method
Links
Success Rate (Extr.)
Success Rate (Distr.)
Success Rate (Safe)
Success Rate (LongH)
Success Rate (Mean)
Wall-Clock Time (All) (Extr.)
Wall-Clock Time (All) (Distr.)
Wall-Clock Time (All) (Safe)
Wall-Clock Time (All) (LongH)
Wall-Clock Time (All) (Mean)
Wall-Clock Time (Succ.) (Extr.)
Wall-Clock Time (Succ.) (Distr.)
Wall-Clock Time (Succ.) (Safe)
Wall-Clock Time (Succ.) (LongH)
Wall-Clock Time (Succ.) (Mean)
Premover
Setting=Premover (ours...
2026.05
25.9
41.4
35.9
0
30.9
87
59.9
62.7
148.4
76.6
43.5
25.6
23.8
-
28.7
Full-prompt
Setting=Full-prompt, P...
2026.05
25.1
39.4
41.8
0
33
99.9
67.7
68.4
162.3
85.4
49.6
32
32.6
-
36
Naive Premoving
Setting=Naive Premovin...
2026.05
8.6
32.9
41
0
27
92.3
59.8
53.5
147.9
73.8
31.9
21.8
19.5
-
21.1
Feedback
Search any
task
Search any
task