Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

D4RL Hopper

Benchmarks

Task NameDataset NameSOTA ResultTrend
Offline Reinforcement LearningD4RL Hopper Medium v2
Normalized Return100.1
43
LocomotionD4RL Hopper medium-offline
Score40.77
36
Offline Reinforcement LearningD4RL Hopper (Expert)
Mean Normalized Score116.6
32
Offline Reinforcement LearningD4RL Hopper Medium-Replay
Reward100.7
32
Offline Reinforcement LearningD4RL Hopper Med-Expert
Normalized Average Return113
21
Continuous ControlD4RL Hopper medium
Normalized Return103.6
19
Backdoor Attack on Offline RLD4RL Hopper v2 (offline)
ASR93.1
9
Offline-to-online Reinforcement LearningD4RL Hopper expert discretized
Online Normalised Score47.1
9
Offline-to-online Reinforcement LearningD4RL Hopper medium discretized
Online Normalized Score47.9
9
Offline Reinforcement LearningD4RL Hopper Full-Replay
Normalized Score112.5
8
Offline Behavior DistillationD4RL Hopper (medium-expert)
Normalized Return107.3
8
Offline Behavior DistillationD4RL Hopper medium
Normalized Return56.4
8
Offline Reinforcement LearningD4RL Hopper Simultaneous Adversarial Corruption
Average Score24.8
8
Offline Reinforcement LearningD4RL Hopper (Simultaneous Random Corruption)
Average Score28.83
8
Offline Reinforcement LearningStochastic D4RL Hopper Medium MuJoCo
Mean Return1,014
8
Offline Policy EvaluationD4RL Hopper medium
RMSE8.5
7
LocomotionD4RL Hopper Random
Mean Return63.3
5
Inverse Reinforcement LearningD4RL Hopper medium-expert
Return3,512
5
Offline Inverse Reinforcement LearningD4RL Hopper Medium-Expert v2
Cumulative Reward3,366.23
4
Reinforcement LearningD4RL Hopper short feet (medium)
Mean Return3,060
4
Reinforcement LearningD4RL Hopper broken hips (medium)
Mean Return2,785
4
Reinforcement LearningD4RL Hopper Med-Expert
D4RL Score1.0389
2
Reinforcement LearningD4RL Hopper Medium
D4RL Score80.86
2
Showing 23 of 23 rows