Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Regret minimization in Reinforcement Learning on Linear MDP
Loading...
4
Regret
DOERL
3.8
3.9
4
4.1
May 1, 2026
Regret
Estimation Oracle Calls
Planning Oracle Calls
Updated 1mo ago
Evaluation Results
Method
Method
Links
Regret
Estimation Oracle Calls
Planning Oracle Calls
DOERL
Estimation oracle type...
2026.05
4
-
-
Feedback
Search any
task
Search any
task