Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Continuous Control on DM Control reacher-hard
Loading...
0.9705
Average Reward
MEOW
0.7495
0.806875
0.86425
0.921625
Sep 24, 2025
Oct 21, 2025
Nov 17, 2025
Dec 14, 2025
Jan 10, 2026
Feb 6, 2026
Mar 5, 2026
Average Reward
Updated 26d ago
Evaluation Results
Method
Method
Links
Average Reward
MEOW
Training timesteps=1M,...
2025.09
0.9705
SAC
Training timesteps=1M,...
2025.09
0.9637
FQL
Training timesteps=1M,...
2025.09
0.9624
TD3
Training timesteps=1M,...
2025.09
0.9497
RC-SAC
training steps=50 [k],...
2026.03
0.91
SAC
training steps=50 [k],...
2026.03
0.9
RC-TD3
training steps=50 [k],...
2026.03
0.82
TD3
training steps=50 [k],...
2026.03
0.76
DDPG
Training timesteps=1M,...
2025.09
0.758
Feedback
Search any
task
Search any
task