Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Flappy Task 3

32.29Grand Average Return

Qreg+NWLU

9.545215.450121.35527.2599May 21, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.05
32.29
2026.05
30.41
2026.05
26.93
2026.05
23.15
2026.05
22.69
2026.05
22.63
2026.05
22.17
2026.05
14.98
2026.05
10.42