Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Regret Minimization benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Regret Minimization
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Synthetic Subspace
Oracle GP-TS
Average Total Regret
86
10
2mo ago
Synthetic Lengthscale
Oracle GP-TS
Average Total Regret
28.1
10
2mo ago
Synthetic Kernel
Oracle GP-TS
Average total regret
35
10
2mo ago
Two dim reward function synthetic (test)
RMEL
Oracle Regret
2,589.32
9
3mo ago
Sine reward function synthetic (test)
RMEL
Oracle Regret
289.86
9
3mo ago
Triangle reward function synthetic (test)
Zooming
Oracle Regret
366.58
9
3mo ago
Two dim reward function weak adversaries Appendix A.7 (test)
RMEL
Oracle Regret
2,589.32
9
3mo ago
Sine reward function weak adversaries Appendix A.7 (test)
RMEL
Oracle Regret
280.62
9
3mo ago
Triangle reward function weak adversaries Appendix A.7 (test)
Zooming
Oracle Regret/Score
366.58
9
3mo ago
Episodic MNL mixture MDP
Hwang and Oh
Regret Bound
1
7
6d ago
Synthetic Patient Cohort Ablation Grid
UCB-BOLD
Cumulative Normalized Regret CVaR (1-alpha=0.5)
0.13
6
8d ago
PNW Precip 1980-1994 (test)
HP-GP-TS
Average Total Regret
167.7
6
2mo ago
PeMS
PE-GP-TS
Average Total Regret
1,214.2
6
2mo ago
Intel
EEI
Average Total Regret
51.6
6
2mo ago
Multi-armed Bandit anytime setting
Naive implementation
Regret Coefficient
2
5
3mo ago
Regret minimization with static historical data
KLUCB-H
Regret Upper Bound
0
4
8d ago
F_LR Stochastic Low-Rank Reward
Noisy power method (NPM)
Regret
2
4
3mo ago
F_EV Stochastic Eigenvalue Reward settings
Lower Bound
Regret
2
4
3mo ago
Multi-armed bandits under network interference
NSE-FS (Alg. 1)
Lower Bound
1
3
6d ago
Bilateral Trade Correlated Valuations
Fixed-price mechanism
Regret Bound
1
3
14d ago
Bilateral Trade Independent Valuations
Fixed-price mechanism
Regret Bound
1
3
14d ago
Contextual Combinatorial Bandits
Kong et al. (2023)
Stochastic Regret
2
3
2mo ago
Simple World Comm
NePPO
Maximum Regret
17.26
3
2mo ago
Generalized Linear Contextual Bandits
BGLE
Worst-Case Regret
2
2
1d ago
Full-feedback mechanisms with Independent Values
Fixed-price mechanism
Regret Bound
1
2
14d ago
Showing 25 of 88 rows
25 / page
50 / page
100 / page
1
2
3
4
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs