Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Imitation Learning on Pendulum
Loading...
-179.6
Mean Score
Expert (TRPO)
-673.496
-545.273
-417.05
-288.827
Oct 30, 2017
Mean Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean Score
Expert (TRPO)
2017.10
-179.6
AIRL
state-only=false
2017.10
-204.7
AIRL
state-only=true
2017.10
-221.5
GAIL
2017.10
-226
GAN-GCL
2017.10
-261.5
Random
2017.10
-654.5
Feedback
Search any
task
Search any
task