Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Unsupervised Reinforcement Learning on DMC (DeepMind Control Suite) Maze
Loading...
11.22
Entropy H(Mπ_S)
Soft FB_flow
10.8144
10.9197
11.025
11.1303
Feb 6, 2026
Entropy H(Mπ_S)
Entropy H(Mπ)
Neg KL(Mπ_S; Mπ_stoch_S)
Neg KL(Mπ; Mπ_stoch)
Neg KL(Mπ_S; Mπ_det_S)
Neg KL(Mπ; Mπ_det)
Updated 4d ago
Evaluation Results
Method
Method
Links
Entropy H(Mπ_S)
Entropy H(Mπ)
Neg KL(Mπ_S; Mπ_stoch_S)
Neg KL(Mπ; Mπ_stoch)
Neg KL(Mπ_S; Mπ_det_S)
Neg KL(Mπ; Mπ_det)
Soft FB_flow
Zero-shot=true, Averag...
2026.02
11.22
15.54
-5.25
-1.39
-5.07
-2.51
FB_flow
Zero-shot=true, Averag...
2026.02
10.83
10.52
-6.49
-6.45
-6.57
-7.27
Feedback
Search any
task
Search any
task