Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Online Clustering of Bandits

About

We introduce a novel algorithmic approach to content recommendation based on adaptive clustering of exploration-exploitation ("bandit") strategies. We provide a sharp regret analysis of this algorithm in a standard stochastic noise setting, demonstrate its scalability properties, and prove its effectiveness on a number of artificial and real-world datasets. Our experiments show a significant increase in prediction performance over state-of-the-art methods for bandit problems.

Claudio Gentile, Shuai Li, Giovanni Zappella• 2014

Related benchmarks

TaskDatasetResultRank
Multi-Robot Task AllocationCanonical Masked Harness
Unseen Skill Rho (0.25, Masked)0.26
3
Showing 1 of 1 rows

Other info

Follow for update