Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Qbert

7,000Average Reward

Average Human

4,9205,4606,0006,540May 29, 2024
Updated 3mo ago

Evaluation Results

MethodLinks
7,000
2024.05
5,200
2024.05
5,175
2024.05
5,000