Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

D4RL Hopper

Benchmarks

Task NameDataset NameSOTA ResultTrend
Offline Reinforcement LearningD4RL Hopper Medium v2
Normalized Return100.1
43
LocomotionD4RL Hopper medium-offline
Score40.77
36
Offline Reinforcement LearningD4RL Hopper Medium-Replay
Reward100.7
30
Offline Reinforcement LearningD4RL Hopper Med-Expert
Normalized Average Return113
21
Continuous ControlD4RL Hopper medium
Normalized Return103.6
19
Offline Reinforcement LearningD4RL Hopper (Expert)
Mean Normalized Score113.2
16
Offline Behavior DistillationD4RL Hopper (medium-expert)
Normalized Return107.3
8
Offline Behavior DistillationD4RL Hopper medium
Normalized Return56.4
8
Offline Reinforcement LearningD4RL Hopper Simultaneous Adversarial Corruption
Average Score24.8
8
Offline Reinforcement LearningD4RL Hopper (Simultaneous Random Corruption)
Average Score28.83
8
Offline Reinforcement LearningStochastic D4RL Hopper Medium MuJoCo
Mean Return1,014
8
Offline Policy EvaluationD4RL Hopper medium
RMSE8.5
7
Offline Inverse Reinforcement LearningD4RL Hopper Medium-Expert v2
Cumulative Reward3,366.23
4
Reinforcement LearningD4RL Hopper short feet (medium)
Mean Return3,060
4
Reinforcement LearningD4RL Hopper broken hips (medium)
Mean Return2,785
4
Reinforcement LearningD4RL Hopper Med-Expert
D4RL Score1.0389
2
Reinforcement LearningD4RL Hopper Medium
D4RL Score80.86
2
Showing 17 of 17 rows