Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Regret Minimization on KL-regularized Bandits
Loading...
2
Sample Complexity
Online Iterative GSHF
1.9
1.95
2
2.05
Feb 11, 2025
Sample Complexity
Regret
Lower Bound Match Status
Updated 1mo ago
Evaluation Results
Method
Method
Links
Sample Complexity
Regret
Lower Bound Match Status
Online Iterative GSHF
Coverage=×
2025.02
2
-
-
Two-Stage Mixed-Policy Sampling
Coverage=✓
2025.02
2
-
-
KL-UCB
Type=Upper Bound
2026.03
-
2
-
Feedback
Search any
task
Search any
task