Our new X account is live! Follow @wizwand_team for updates

Reinforcement Learning on single-gate Double Integrator dynamics

11.6Mean Return

DMPS

Updated 4d ago

Evaluation Results

Method	Links
DMPS 2024.05		11.6	0
MPS 2024.05		11.6	0
DMPS 2024.05		11.5	0.1
MPS 2024.05		11.4	0.1
PPO-Lag 2024.05		-2	1.1
TD3 2024.05		-2.1	0.2
PPO-Lag 2024.05		-2.4	0.2
CPO 2024.05		-2.5	0.3
TD3 2024.05		-3.1	0.5
CPO 2024.05		-19.6	17