Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deep Reinforcement Learning on Pong Atari 2600
Loading...
19.8
IQM Return
Spectral Routing
19.28
19.415
19.55
19.685
Apr 15, 2026
IQM Return
Bellman Target Variance (%)
Updated 3d ago
Evaluation Results
Method
Method
Links
IQM Return
Bellman Target Variance (%)
Spectral Routing
Replay Strategy=Spectr...
2026.04
19.8
2
REM
Replay Strategy=Random...
2026.04
19.6
-
PER
Replay Strategy=Priori...
2026.04
19.5
-
Uniform Replay
Replay Strategy=Unifor...
2026.04
19.3
-
Feedback
Search any
task
Search any
task