Share your thoughts, 1 month free Claude Pro on usSee more

Regret Minimization

Benchmarks

Dataset Name	SOTA Method	Metric
Synthetic Subspace		Average Total Regret86	10	4mo ago
Synthetic Lengthscale		Average Total Regret28.1	10	4mo ago
Synthetic Kernel		Average total regret35	10	4mo ago
Discrete feature values setting n=5000 (simulation)	SUPER	Mean Regret0.17	9	1mo ago
Two dim reward function synthetic (test)	RMEL	Oracle Regret2,589.32	9	5mo ago
Sine reward function synthetic (test)	RMEL	Oracle Regret289.86	9	5mo ago
Triangle reward function synthetic (test)	Zooming	Oracle Regret366.58	9	5mo ago
Two dim reward function weak adversaries Appendix A.7 (test)	RMEL	Oracle Regret2,589.32	9	5mo ago
Sine reward function weak adversaries Appendix A.7 (test)	RMEL	Oracle Regret280.62	9	5mo ago
Triangle reward function weak adversaries Appendix A.7 (test)	Zooming	Oracle Regret/Score366.58	9	5mo ago
Episodic MNL mixture MDP	Hwang and Oh	Regret Bound1	7	1mo ago
Episodic Tabular MDPs Adversarial Regime	Dann et al. (2023, Theorem 4.3)	Regret Upper Bound2	6	25d ago
Combinatorial Bandits Theoretical	[10]	Regret0.5	6	1mo ago
Synthetic Patient Cohort Ablation Grid	UCB-BOLD	Cumulative Normalized Regret CVaR (1-alpha=0.5)0.13	6	2mo ago
PNW Precip 1980-1994 (test)	HP-GP-TS	Average Total Regret167.7	6	4mo ago
PeMS	PE-GP-TS	Average Total Regret1,214.2	6	4mo ago
Intel		Average Total Regret51.6	6	4mo ago
Episodic Tabular MDPs Stochastic Regime with Adversarial Corruption	Dann et al. (2023, Theorem 4.3)	Regret Upper Bound2	5	25d ago
Multi-armed Bandit anytime setting		Regret Coefficient2	5	5mo ago
Uniform Auction Non-Stationary Adversary, K=5, T=5000	A3M	Final Regret0.0029	4	26d ago
Regret minimization with static historical data	KLUCB-H	Regret Upper Bound0	4	2mo ago
K-armed bandits Gaussian rewards		Regret114.4	4	19d ago
F_LR Stochastic Low-Rank Reward	Noisy power method (NPM)	Regret2	4	4mo ago
F_EV Stochastic Eigenvalue Reward settings	Lower Bound	Regret2	4	4mo ago
Matching Bandits Theoretical Bound		Regret2	3	26d ago

Showing 25 of 109 rows