Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on MuJoCo w/o humanoid v4
Loading...
423.4
Runtime (seconds)
PDA
413.224
481.912
550.6
619.288
Mar 10, 2026
Runtime (seconds)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Runtime (seconds)
PDA
Hardware=Intel i7-1470...
2026.03
423.4
TRPO
Hardware=Intel i7-1470...
2026.03
512.3
NPG
Hardware=Intel i7-1470...
2026.03
566.5
PPO
Hardware=Intel i7-1470...
2026.03
677.8
Feedback
Search any
task
Search any
task