Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline multitask Reinforcement Learning on Hopper backward

596Average Episodic Return

MOPO

177.92286.46395503.54Mar 10, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.03
596
2024.03
596
2024.03
367
2024.03
367
2024.03
269
2024.03
269
2024.03
261
2024.03
261
2024.03
220
2024.03
220
2024.03
194
2024.03
194