Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Reinforcement Learning on D4RL HalfCheetah Med-Expert v2

105.9Avg Normalized Return

MBOP

38.19655.77373.3590.927Jun 3, 2021Jul 15, 2021Aug 27, 2021Oct 8, 2021Nov 20, 2021Jan 1, 2022Feb 13, 2022
Updated 1mo ago

Evaluation Results

MethodLinks
2021.06
105.9
2021.06
95
2022.02
93.4
2021.06
91.6
2022.02
91.6
2022.02
90.7
2022.02
86.9
86.8
2022.02
86.8
2022.02
86.7
59.9
2022.02
55.2
2022.02
42.8
2021.06
41.9
2021.06
40.8