Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Unsupervised Reinforcement Learning on DMC (DeepMind Control Suite) Cheetah
Loading...
13.63
Entropy H(Mπ_S)
Soft FB_flow
11.6644
12.1747
12.685
13.1953
Feb 6, 2026
Entropy H(Mπ_S)
Entropy H(Mπ)
Neg KL(Mπ_S; Mπ_stoch_S)
Neg KL(Mπ; Mπ_stoch)
Neg KL(Mπ_S; Mπ_det_S)
Neg KL(Mπ; Mπ_det)
Updated 4d ago
Evaluation Results
Method
Method
Links
Entropy H(Mπ_S)
Entropy H(Mπ)
Neg KL(Mπ_S; Mπ_stoch_S)
Neg KL(Mπ; Mπ_stoch)
Neg KL(Mπ_S; Mπ_det_S)
Neg KL(Mπ; Mπ_det)
Soft FB_flow
Zero-shot=true, Averag...
2026.02
13.63
17.83
-4.56
-1.02
-4.38
-1.23
FB_flow
Zero-shot=true, Averag...
2026.02
11.74
11.68
-5.56
-7.25
-5.45
-7.55
Feedback
Search any
task
Search any
task