Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Tabular MDP

Benchmarks

Task NameDataset NameSOTA ResultTrend
Unregularized Reinforcement LearningTabular MDP Finite State Action Spaces
Sample Complexity1
3
Entropy-Regularized Reinforcement LearningTabular MDP Finite State Action Spaces
Sample Complexity1
2
Regret minimization in Reinforcement LearningTabular MDP
Cumulative Regret3
1
Reinforcement LearningTabular MDP
Sample Complexity0.5
1
Showing 4 of 4 rows