Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Flappy Task 1

94.53Grand Average Return (G)

Qreg+NWLU

60.303669.189378.07586.9607May 21, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.05
94.53
2026.05
88.82
2026.05
81.87
2026.05
81.36
2026.05
79.07
2026.05
73.46
2026.05
71.14
2026.05
69.39
2026.05
61.62