Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reinforcement Learning on Hopper (Average Episode Reward)

2,743.9Avg Episode Reward

TD3

-60.512667.55651,395.6252,123.6935Nov 2, 2023Mar 17, 2024Jul 31, 2024Dec 14, 2024Apr 29, 2025Sep 12, 2025Jan 26, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2023.11
2,743.9
2023.11
2,613.16
2023.11
2,593.56
2023.11
2,586.56
2023.11
2,583.88
2023.11
2,442.48
2023.11
2,122.4
2023.11
2,104.98
2023.11
1,678.84
2026.01
331.2
2026.01
305
2026.01
303.8
2026.01
180.5
2026.01
172.7
2023.11
47.35