Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Unsupervised Reinforcement Learning on DMC (DeepMind Control Suite) Walker
Loading...
13.97
Entropy (State-Dependent Policy)
Soft FB_flow
12.5036
12.8843
13.265
13.6457
Feb 6, 2026
Entropy (State-Dependent Policy)
Entropy (Marginal Policy)
Neg KL Divergence (State vs Stoch State)
Neg KL Divergence (Marginal vs Stoch)
Neg KL Divergence (State vs Det State)
Neg KL Divergence (Marginal vs Det)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Entropy (State-Dependent Policy)
Entropy (Marginal Policy)
Neg KL Divergence (State vs Stoch State)
Neg KL Divergence (Marginal vs Stoch)
Neg KL Divergence (State vs Det State)
Neg KL Divergence (Marginal vs Det)
Soft FB_flow
Zero-shot=true, Averag...
2026.02
13.97
18.01
-5.25
-1.48
-5.69
-1.53
FB_flow
Zero-shot=true, Averag...
2026.02
12.56
9.86
-5.35
-9.4
-5.69
-9.43
Feedback
Search any
task
Search any
task