Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Inverse Reinforcement Learning on MuJoCo halfcheetah (medium-replay)

12,174.61Average Reward

Expert Performance

-32.31723,136.78896,305.8959,475.0011Feb 15, 2023
Updated 1mo ago

Evaluation Results

MethodLinks
2023.02
12,174.61
2023.02
9,236.84
2023.02
4,471.72
2023.02
1,125.17
2023.02
437.18