Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RockSample

Benchmarks

Task NameDataset NameSOTA ResultTrend
Sequential Decision MakingRockSample (8, 4, 10, -1)
Average Reward16.7
30
Sequential Decision MakingRockSample (7, 8, 20, 0)
Average Reward28.5
30
POMDP PlanningRockSample (15, 15)
Expected Return20.53
19
Value computation in MEPOMDPsRockSample
Computation Time (s)0.0581
10
POMDP PlanningRockSample (20, 20)
Expected Return12.31
10
POMDP PlanningRockSample (25, 25)
Returns4.8
6
POMDP PlanningRockSample (11, 11)
Expected Return19.09
5
POMDP PlanningRockSample (7, 8)
Expected Return21.57
5
POMDP PlanningRockSample (4, 4)
PBVI Original Time (s)0.092
1
Showing 9 of 9 rows