Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Stair arm-to-reward function

Benchmarks

Task NameDataset NameSOTA ResultTrend
Best Arm IdentificationStair arm-to-reward function K=55
Error Rate0.57
15
Best Arm IdentificationStair arm-to-reward function K=21
Error Probability0.09
15
Best Arm IdentificationStair arm-to-reward function K=15
Error Rate0.02
15
Showing 3 of 3 rows