Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MDP with Tucker rank

Benchmarks

Task NameDataset NameSOTA ResultTrend
Regret MinimizationMDP with Tucker Rank (d, S, A) (theoretical)
Metric-
0
Regret MinimizationMDP with Tucker Rank (S, d, A) (theoretical)
Metric-
0
Regret MinimizationMDP with Tucker Rank (S, S, d) (theoretical)
Metric-
0
Transfer Reinforcement LearningMDP with Tucker rank (S, d, A)
Metric-
0
Showing 4 of 4 rows