Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

D4RL Walker2d

Benchmarks

Task NameDataset NameSOTA ResultTrend
Offline Reinforcement LearningD4RL Walker2d Medium v2
Normalized Return94.2
67
LocomotionD4RL Walker2d medium-offline
Normalized Score37.45
36
Offline Reinforcement LearningD4RL walker2d medium-replay v2
Normalized Score100.6
36
Offline Reinforcement LearningD4RL Walker2D Expert
Mean Normalized Score117.4
30
Locomotion ControlD4RL Walker2d medium-expert
Normalized Return111.2
23
Offline Reinforcement LearningD4RL Walker2d Medium
Normalized Avg Return87.7
18
Continuous ControlD4RL Walker2d medium
Normalized Return81.9
14
Offline Reinforcement LearningD4RL Walker2d Med-Replay
Normalized Average Return82.6
11
Offline Behavior DistillationD4RL Walker2D (medium-expert)
Normalized Return109
8
Offline Behavior DistillationD4RL Walker2D medium
Normalized Return84
8
Offline Reinforcement LearningD4RL Walker2d Simultaneous Random Corruption
Average Score23.62
8
Offline Reinforcement LearningD4RL Walker2d Stochastic MuJoCo (Mixed)
Mean Return450
8
Offline Policy EvaluationD4RL Walker2d medium
RMSE149
7
Offline Reinforcement LearningD4RL Walker2d random v0
Return412
6
Continuous ControlD4RL Walker2d expert
Normalized Return125.7
5
Offline Inverse Reinforcement LearningD4RL Walker2d Medium-Expert v2
Cumulative Reward4,049.43
4
Offline Inverse Reinforcement LearningD4RL Walker2d Medium v2
Cumulative Reward4,121.68
4
LocomotionD4RL Walker2d-expert
Normalized Score (100k Steps)120.35
3
LocomotionD4RL walker2d medium v2
Normalized Return102.9
2
Off-policy EvaluationD4RL Walker2D-medium-expert
RMAE0.252
2
Showing 20 of 20 rows