Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reinforcement Learning on Pong
Loading...
40
Average Reward
Average Human
-0.8304
9.7698
20.37
30.9702
May 29, 2024
Sep 24, 2024
Jan 20, 2025
May 18, 2025
Sep 13, 2025
Jan 9, 2026
May 8, 2026
Average Reward
Updated 22d ago
Evaluation Results
Method
Method
Links
Average Reward
Average Human
2024.05
40
DDQN
2024.05
19
FDQN
2024.05
18
DQN
2024.05
17
R2PO
Number of runs=10, Agg...
2026.05
2.51
ProPS+
Number of runs=10, Agg...
2026.05
1.22
PPO
Number of runs=10, Agg...
2026.05
1.02
ProPS
Number of runs=10, Agg...
2026.05
0.74
Feedback
Search any
task
Search any
task