Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MuJoCo Ant

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningMuJoCo Ant v2
Average Return10,133
18
Reinforcement LearningMuJoCo Ant v5
Mean Episodic Return5,953
17
Continuous ControlMuJoCo Ant fixed random adversary L=0.1
Average Performance8,041
12
Reinforcement LearningMuJoCo Ant (test)
Average Reward7,586
12
Continuous ControlMuJoCo Ant logarithmic adversary v1
Avg Performance8,061
12
Reinforcement LearningMuJoCo Ant 1.5 density v1 (test)
Reward5,193
7
Offline Meta Reinforcement LearningMuJoCo Ant-dir In-distribution
Average Return863.1
6
Continuous ControlMuJoCo Ant 10-p v4
Normalized Return92.7
6
Continuous ControlMuJoCo Ant 2-p v4
Normalized Return146.1
6
Continuous ControlMuJoCo Ant v2 (train)
Mean Return4,796
6
Reinforcement LearningMuJoCo Ant epsilon=0.15 (test)
Natural Return5,381
5
Continuous ControlMuJoCo Ant 1M steps v3
Average Return5,930
5
Continuous ControlMuJoCo Ant v3 (500K steps)
Average Return5,009
5
Continuous ControlMuJoCo Ant v5 (test)
Average Return5,867
4
Off-dynamics Reinforcement LearningMuJoCo Ant 0.5 density dynamics shift (test)
Reward3,798
4
Inverse Reinforcement LearningMuJoCo Ant (test)
Average Performance5,783
4
Meta-Reinforcement LearningMuJoCo Ant Body variation (test)
CVaR 0.05 Return1,368
2
Meta-Reinforcement LearningMuJoCo Ant Mass variation (test)
CVaR 0.05 Return1,385
2
Meta-Reinforcement LearningMuJoCo Ant Goal variation (test)
CVaR 0.05 Return-454
2
Continuous control locomotionMuJoCo Ant v3 (train)
Avg Performance (1M Steps)762
2
Showing 20 of 20 rows