Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reinforcement Learning on LunarLander v3 (Average Agent Reward)
Loading...
242.1
Average Agent Reward
POEM
209.6936
218.1068
226.52
234.9332
Jan 21, 2026
Average Agent Reward
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Agent Reward
POEM
Evaluation Episodes=15
2026.01
242.1
PPO
Evaluation Episodes=15
2026.01
210.94
Feedback
Search any
task
Search any
task