Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Stair arm-to-reward function

Benchmarks

Task NameDataset NameSOTA ResultTrend
Best Arm IdentificationStair arm-to-reward function K=55
Error Rate0.57
15
Best Arm IdentificationStair arm-to-reward function K=21
Error Probability0.09
15
Best Arm IdentificationStair arm-to-reward function K=15
Error Rate0.02
15
Showing 3 of 3 rows