Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deep Reinforcement Learning on Breakout Atari 2600
Loading...
364
IQM Return
Spectral Routing
326.56
336.28
346
355.72
Apr 15, 2026
IQM Return
Bellman Target Variance Drop
Updated 3d ago
Evaluation Results
Method
Method
Links
IQM Return
Bellman Target Variance Drop
Spectral Routing
Replay Strategy=Spectr...
2026.04
364
23
REM
Replay Strategy=Random...
2026.04
346
-
PER
Replay Strategy=Priori...
2026.04
340
-
Uniform Replay
Replay Strategy=Unifor...
2026.04
328
-
Feedback
Search any
task
Search any
task