Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Imitation Learning on Pendulum

-179.6Mean Score

Expert (TRPO)

-673.496-545.273-417.05-288.827Oct 30, 2017
Updated 1mo ago

Evaluation Results

MethodLinks
-179.6
2017.10
-204.7
2017.10
-221.5
2017.10
-226
2017.10
-261.5
2017.10
-654.5