Share your thoughts, 1 month free Claude Pro on usSee more

Reinforcement Learning on MuJoCo w/o humanoid v4

423.4Runtime (seconds)

PDA

Updated 1mo ago

Evaluation Results

Method	Links
PDA 2026.03		423.4
TRPO 2026.03		512.3
NPG 2026.03		566.5
PPO 2026.03		677.8