Share your thoughts, 1 month free Claude Pro on usSee more

Reinforcement Learning on Gym-MuJoCo Walker2D

4,909Average Return

SiMPO-Linear

Updated 4mo ago

Evaluation Results

Method	Links
SiMPO-Linear 2026.03		4,909
SiMPO-Lin. Neg. 2026.03		4,906
SAC 2026.03		4,625
SiMPO-Exp 2026.03		4,616
SiMPO-Square 2026.03		4,478
QSM 2026.03		3,933
DIPO 2026.03		3,809
TD3 2026.03		3,732
QVPO 2026.03		2,866
DACER 2026.03		1,871