Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Ant

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningAnt v5
Average Return6,633.8
49
Continuous Robot ControlAnt v3 (test)
Reward5,648
48
LocomotionAnt IID (test)
Mean Episode Reward2,240
24
Locomotion ControlAnt sigma 0.5 (test)
Episode Reward974
24
Locomotion ControlAnt sigma 0.3 (test)
Episode Reward1,723
24
Locomotion ControlAnt sigma 0.1 (test)
Episode Reward2,240
24
Locomotion ControlAnt sigma 0.7 (test)
Episode Reward306
18
Offline Reinforcement LearningAnt kinematic shifts
Score120
16
Offline Reinforcement LearningAnt Medium D4RL
Normalized Score96.4
14
Offline Policy Adaptationant medium-expert
Normalized Score79.3
14
Offline Policy Adaptationant medium-replay
Normalized Score76.2
14
Offline Policy Adaptationant medium
Normalized Score77.2
14
Continuous ControlAnt v5
Normalized Mean Return1.14
12
Reinforcement LearningAnt fixed linear adversary
Average Performance8,069
12
Worst-case time-constrained reinforcement learningAnt MuJoCo (test)
Normalized Worst-Case Reward1.66
12
Robust Reinforcement LearningAnt MuJoCo (fixed exponential adversary)
Average Performance7,724
12
Continuous ControlAnt MuJoCo (test)
Worst-case Performance7,534
12
Robot LocomotionAnt v1 (test)
Performance Score2,370.93
12
Imitation LearningAnt one-shot v2
Normalized Score29.7
11
Imitation Learning from ObservationAnt v4
AER5,904.2506
8
Offline Reinforcement LearningAnt expert
Normalized Score23.1
7
Offline Reinforcement LearningAnt random
Normalized Score20.3
7
Continuous ControlAnt v3
Average Return5,115
7
Offline Policy Adaptationant-medium morphology shift target: expert D4RL
Normalized Avg Score74.1
7
Offline Policy Adaptationant medium gravity shift target D4RL
Average Score45.1
7
Showing 25 of 69 rows