Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Regret Minimization in Reinforcement Learning on Tabular MDP
Loading...
3
Cumulative Regret
Levy and Mansour (2023)
2.85
2.925
3
3.075
May 1, 2026
Cumulative Regret
Estimation Oracle Calls
Planning Oracle Calls
Updated 1mo ago
Evaluation Results
Method
Method
Links
Cumulative Regret
Estimation Oracle Calls
Planning Oracle Calls
Levy and Mansour (2023)
Estimation oracle type...
2026.05
3
-
-
Feedback
Search any
task
Search any
task