Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Imitation Learning on Pendulum

-179.6Mean Score

Expert (TRPO)

-673.496-545.273-417.05-288.827Oct 30, 2017
Updated 4d ago

Evaluation Results

MethodLinks
-179.6
2017.10
-204.7
2017.10
-221.5
2017.10
-226
2017.10
-261.5
2017.10
-654.5