Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Unsupervised Reinforcement Learning on DMC (DeepMind Control Suite) Walker

13.97Entropy (State-Dependent Policy)

Soft FB_flow

12.503612.884313.26513.6457Feb 6, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
13.9718.01-5.25-1.48-5.69-1.53
2026.02
12.569.86-5.35-9.4-5.69-9.43