Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Flappy Task 2

54.31Grand Average Return (G)

Qreg+NWLU

25.08632.67340.2647.847May 21, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.05
54.31
2026.05
49.86
2026.05
48.47
2026.05
45
2026.05
43.07
2026.05
42.83
2026.05
42.73
2026.05
35.73
2026.05
26.21