Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Continuous Control on BipedalWalker v3
Loading...
298.4
Episodic Cumulative Reward
MA-MPPI
-128.364
-17.5695
93.225
204.0195
Oct 11, 2023
Feb 7, 2024
Jun 5, 2024
Oct 2, 2024
Jan 29, 2025
May 28, 2025
Sep 24, 2025
Episodic Cumulative Reward
Updated 23d ago
Evaluation Results
Method
Method
Links
Episodic Cumulative Reward
MA-MPPI
2025.09
298.4
AOC-BC
sampler=Behavior Clone...
2023.10
276.98
MPPI
2025.09
241.7
MPC
2025.09
219.6
BC
training_data=offline...
2023.10
208.72
Data-Avg-Return
2023.10
202.25
iLQR
2025.09
184.2
SAC
2025.09
112.6
PPO
2025.09
96.3
DDPG
2025.09
74.8
MFRL
type=model-free RL
2023.10
18.51
AOC-Uniform
sampler=uniform sampler
2023.10
-90.44
MPC
type=model-based RL
2023.10
-96.82
KNN
variant=k-neighbors
2023.10
-109.72
1NN
variant=nearest-neighbor
2023.10
-111.95
Feedback
Search any
task
Search any
task