Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Inverse Reinforcement Learning on MuJoCo halfcheetah (medium-exp)

12,174.61Average Reward

Expert Performance

160.71723,279.70866,398.79,517.6914Feb 15, 2023
Updated 1mo ago

Evaluation Results

MethodLinks
2023.02
12,174.61
2023.02
11,231.4
2023.02
4,471.72
2023.02
1,125.17
2023.02
622.79