Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

D4RL MuJoCo

Benchmarks

Task NameDataset NameSOTA ResultTrend
Offline Reinforcement LearningD4RL MuJoCo Hopper-mr v2 (medium-replay)
Avg Normalized Score104.4
36
Offline Reinforcement LearningD4RL MuJoCo Walker2d medium-expert v2
Average Normalized Score117.2
31
Offline Reinforcement LearningD4RL MuJoCo Hopper-m v2 (medium)
Avg Normalized Score107.4
31
Offline Reinforcement LearningD4RL MuJoCo Walker2d-mr v2 (medium-replay)
Average Normalized Score95.6
29
Offline Reinforcement LearningD4RL MuJoCo Halfcheetah-mr v2 (medium-replay)
Avg Normalized Score72.1
24
Offline Reinforcement LearningD4RL Mujoco Hopper-Medium-Expert v2
Normalized Score111.9
22
Offline Reinforcement LearningD4RL Mujoco Halfcheetah-Medium-Expert v2
Normalized Score94.3
17
Offline Reinforcement LearningD4RL MuJoCo Walker2d-e v2 (expert)
Normalized Score110.2
14
Offline Reinforcement LearningD4RL MuJoCo Walker2d-m medium v2
Average Normalized Score86.8
14
Offline Reinforcement LearningD4RL MuJoCo Halfcheetah-m v2 (medium)
Average Normalized Score48.3
14
Offline Reinforcement LearningD4RL MuJoCo Hopper-e v2 (expert)
Average Normalized Score110
14
Offline Reinforcement LearningD4RL MuJoCo hopper-medium-expert
Normalized Score111.2
13
Offline Reinforcement LearningD4RL MuJoCo Locomotion Domain v2
Return (HalfCheetah, M-E)96
10
LocomotionD4RL MuJoCo hopper-medium-replay v2
Normalized Score98.94
10
Offline Reinforcement LearningD4RL MuJoCo v2 (test)
HalfCheetah-Medium Score77.6
10
Offline Reinforcement LearningD4RL MuJoCo walker2d-medium-expert v0
Normalized Score108.4
8
Offline Reinforcement LearningD4RL MuJoCo hopper-medium-expert v0
Normalized Avg Score112.7
8
Offline Reinforcement LearningD4RL MuJoCo hopper-medium v0
Avg Score (Normalized)100.4
8
Offline Reinforcement LearningD4RL MuJoCo hopper-random v0
Normalized Score11.9
8
Offline Reinforcement LearningD4RL MuJoCo halfcheetah-medium-replay v2
Normalized Score70.7
7
Cross-domain Offline Imitation Learning from Demonstrations (C-off-LfD)D4RL MuJoCo reward-free v2 (medium, medium-replay, medium-expert)
Hopper-v2 Return (medium)58.4
7
Single-domain Offline Imitation Learning from Demonstrations (S-off-LfD)D4RL MuJoCo reward-free v2 (medium, medium-replay, medium-expert)
Hopper-v2 (m) Score110.4
7
halfcheetah-medium-expertD4RL MuJoCo (medium-expert)
Normalized Return86.5
4
walker2d-medium-expertD4RL MuJoCo (medium-expert)
Normalized Return110.3
4
walker2d-mediumD4RL MuJoCo (medium)
Normalized Return72.7
4
Showing 25 of 30 rows