Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Regret Minimization benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Regret Minimization
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Synthetic Subspace
Oracle GP-TS
Average Total Regret
86
10
1mo ago
Synthetic Lengthscale
Oracle GP-TS
Average Total Regret
28.1
10
1mo ago
Synthetic Kernel
Oracle GP-TS
Average total regret
35
10
1mo ago
Two dim reward function synthetic (test)
RMEL
Oracle Regret
2,589.32
9
1mo ago
Sine reward function synthetic (test)
RMEL
Oracle Regret
289.86
9
1mo ago
Triangle reward function synthetic (test)
Zooming
Oracle Regret
366.58
9
1mo ago
Two dim reward function weak adversaries Appendix A.7 (test)
RMEL
Oracle Regret
2,589.32
9
1mo ago
Sine reward function weak adversaries Appendix A.7 (test)
RMEL
Oracle Regret
280.62
9
1mo ago
Triangle reward function weak adversaries Appendix A.7 (test)
Zooming
Oracle Regret/Score
366.58
9
1mo ago
PNW Precip 1980-1994 (test)
HP-GP-TS
Average Total Regret
167.7
6
1mo ago
PeMS
PE-GP-TS
Average Total Regret
1,214.2
6
1mo ago
Intel
EEI
Average Total Regret
51.6
6
1mo ago
Multi-armed Bandit anytime setting
Naive implementation
Regret Coefficient
2
5
1mo ago
F_LR Stochastic Low-Rank Reward
Noisy power method (NPM)
Regret
2
4
1mo ago
F_EV Stochastic Eigenvalue Reward settings
Lower Bound
Regret
2
4
1mo ago
Contextual Combinatorial Bandits
Kong et al. (2023)
Stochastic Regret
2
3
23d ago
Simple World Comm
NePPO
Maximum Regret
17.26
3
1mo ago
Bilateral Trade Nonparametric
epoch-based algorithms using truncated-mean estimation
Regret
1
2
1mo ago
KL-regularized Bandits
Online Iterative GSHF
Sample Complexity
2
2
1mo ago
K-armed bandits Exponential Family rewards
KL-UCB++
Finite-Time Regret (Minimax Ratio)
1
2
1mo ago
K-armed bandits [0, 1] rewards
MOSS
Finite-Time Regret (Minimax Ratio)
1
2
1mo ago
Homogeneous Users Bayesian Recommendation
Unknown ordinal preference (Arbitrary preference)
Regret Upper Bound
2
1
26d ago
Bilateral Trade Parametric
epoch-based algorithms using truncated-mean estimation
Regret
2
1
1mo ago
KL-regularized Bandits Data Coverage
TMPS
Regret
2
1
1mo ago
KL-regularized Bandits Preference w/ Linear Reward
Online Iterative GSHF
Regret
2
1
1mo ago
Showing 25 of 69 rows
25 / page
50 / page
100 / page
1
2
3
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs