Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

D4RL Walker

Benchmarks

Task NameDataset NameSOTA ResultTrend
Offline Reinforcement LearningD4RL Walker Medium
Reward77.84
10
Offline-to-online Reinforcement LearningD4RL Walker expert discretized
Online Normalized Score14.8
9
Offline-to-online Reinforcement LearningD4RL Walker medium discretized
Online Normalised Score15.9
9
LocomotionD4RL Walker Random
Mean Return50.4
5
Reinforcement LearningD4RL Walker Medium-Expert
Mean Normalized Return100
5
Reinforcement LearningD4RL Walker Random
Mean Normalized Return47.1
5
Reinforcement LearningD4RL Walker no right thigh (medium)
Mean Return3,293
4
Reinforcement LearningD4RL Walker broken right thigh (medium)
Mean Return3,743
4
Reinforcement LearningD4RL Walker Med-Expert
D4RL Score110.51
2
Reinforcement LearningD4RL Walker Med-Replay
D4RL Score72.36
2
Showing 10 of 10 rows