Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

D4RL Cheetah

Benchmarks

Task NameDataset NameSOTA ResultTrend
LocomotionD4RL Cheetah Medium
Mean Return5,277.5
17
Offline-to-online Reinforcement LearningD4RL Cheetah expert discretized
Online Normalized Score9.7
9
Offline-to-online Reinforcement LearningD4RL Cheetah medium discretized
Online Score16.9
9
LocomotionD4RL Cheetah Medium-Expert
Mean Return97.1
5
LocomotionD4RL Cheetah Medium-Replay
Mean Return90.7
5
LocomotionD4RL Cheetah Random
Mean Return77.1
5
Reinforcement LearningD4RL Cheetah Medium-Expert
Mean Normalized Return98.7
5
Showing 7 of 7 rows