Share your thoughts, 1 month free Claude Pro on usSee more

Reinforcement Learning on MuJoCo Humanoid

10,249Average Return

SPMD

Updated 4mo ago

Evaluation Results

Method	Links
SPMD 2023.05		10,249
SAC 2023.05		6,923
SiMPO-Lin. Neg. 2026.03		5,466
SiMPO-Linear 2026.03		5,376
SAC 2026.03		5,298
TD3 2026.03		5,263
DIPO 2026.03		5,184
SiMPO-Exp 2026.03		5,100
SiMPO-Square 2026.03		5,068
DACER 2026.03		3,142
QSM 2026.03		2,308
QVPO 2026.03		1,375