Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on Flappy Task 3
Loading...
32.29
Grand Average Return
Qreg+NWLU
9.5452
15.4501
21.355
27.2599
May 21, 2026
Grand Average Return
Updated 12d ago
Evaluation Results
Method
Method
Links
Grand Average Return
Qreg+NWLU
2026.05
32.29
Qreg
2026.05
30.41
PM
2026.05
26.93
PackNet
2026.05
23.15
MER
2026.05
22.69
DQN
2026.05
22.63
DDQN
2026.05
22.17
EWC
2026.05
14.98
L2
2026.05
10.42
Feedback
Search any
task
Search any
task