Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on DeepMind Control Cartpole Balance Sparse

150,700Steps to 75% Return

Dreamer V3

120,860322,280523,700725,120May 15, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.05
150,700
2026.05
158,300
2026.05
166,300
2026.05
166,700
2026.05
170,400
2026.05
171,300
2026.05
174,400
2026.05
178,500
2026.05
345,000
2026.05
381,700
896,700