Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LunarLander

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reinforcement LearningLunarLander v2
Final Return2,292
23
Reinforcement LearningLunarLander
Average Episode Reward283.56
10
Continuous ControlLunarLander Nonmarkov v2 (test)
AUC@T107.6
9
Reinforcement LearningLunarLander classical control 1M steps
Return267.19
8
Trajectory RankingLunarLander v2
Average Reward207.13
6
Reinforcement LearningLunarLander
Environment Episodes400,000
3
Meta-Reinforcement LearningLunarlander g
FLOPs (k)0.015
3
Reinforcement LearningLunarLander v3
Average Agent Reward242.1
2
Reinforcement Learninglunarlander Sticky
AUC@T36,783,880.67
2
Reinforcement Learninglunarlander Noisy
AUC @ T-25,766,227.01
2
Reinforcement Learninglunarlander Clean
AUC@T42,642,395.61
2
Reinforcement LearningLunarLander standard (test)
Episode Length16.5
2
Interpretability EvaluationLunarLander
Interpretability Score4
2
Stochastic Lipschitz OptimizationLunarLander
Regret7
1
Meta-Reinforcement LearningLunarLander
Metric-
0
Continuous ControlLunarLander Nonmarkov v2
AUC@T-
0
Showing 16 of 16 rows