Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Flappy Task 5

9.55Grand Average Return (G)

Qreg+NWLU

2.40524.26016.1157.9699May 21, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.05
9.55
2026.05
8.83
2026.05
7.29
2026.05
5.15
2026.05
4.86
2026.05
4.69
2026.05
4.47
2026.05
3.3
2026.05
2.68