Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Flappy Task 4

18.01Grand Average Return

Qreg+NWLU

4.94768.338811.7315.1212May 21, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.05
18.01
2026.05
17.69
2026.05
14.29
2026.05
11.75
2026.05
11.12
2026.05
10.99
2026.05
10.41
2026.05
7.22
2026.05
5.45