Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Inverse Reinforcement Learning on MuJoCo hopper (medium-exp)

3,512.09Average Reward

Expert Performance

2,772.7542,964.6973,156.643,348.583Feb 15, 2023
Updated 1mo ago

Evaluation Results

MethodLinks
2023.02
3,512.09
2023.02
3,350.47
2023.02
3,347.11
2023.02
3,073.16
2023.02
2,801.19