Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Unsupervised Reinforcement Learning on DMC (DeepMind Control Suite) Walker
Loading...
13.97
Entropy (State-Dependent Policy)
Soft FB_flow
12.5036
12.8843
13.265
13.6457
Feb 6, 2026
Entropy (State-Dependent Policy)
Entropy (Marginal Policy)
Neg KL Divergence (State vs Stoch State)
Neg KL Divergence (Marginal vs Stoch)
Neg KL Divergence (State vs Det State)
Neg KL Divergence (Marginal vs Det)
Updated 4d ago
Evaluation Results
Method
Method
Links
Entropy (State-Dependent Policy)
Entropy (Marginal Policy)
Neg KL Divergence (State vs Stoch State)
Neg KL Divergence (Marginal vs Stoch)
Neg KL Divergence (State vs Det State)
Neg KL Divergence (Marginal vs Det)
Soft FB_flow
Zero-shot=true, Averag...
2026.02
13.97
18.01
-5.25
-1.48
-5.69
-1.53
FB_flow
Zero-shot=true, Averag...
2026.02
12.56
9.86
-5.35
-9.4
-5.69
-9.43
Feedback
Search any
task
Search any
task