Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reinforcement Learning on single-gate Double Integrator dynamics
Loading...
11.6
Mean Return
DMPS
-20.848
-12.424
-4
4.424
May 22, 2024
Mean Return
Return Std Dev
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean Return
Return Std Dev
DMPS
2024.05
11.6
0
MPS
2024.05
11.6
0
DMPS
2024.05
11.5
0.1
MPS
2024.05
11.4
0.1
PPO-Lag
2024.05
-2
1.1
TD3
2024.05
-2.1
0.2
PPO-Lag
2024.05
-2.4
0.2
CPO
2024.05
-2.5
0.3
TD3
2024.05
-3.1
0.5
CPO
2024.05
-19.6
17
Feedback
Search any
task
Search any
task