Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deep Reinforcement Learning on Q*bert Atari 2600
Loading...
8,160
IQM Return
Spectral Routing
7,572.4
7,724.95
7,877.5
8,030.05
Apr 15, 2026
IQM Return
Bellman Variance Drop (%)
Updated 3d ago
Evaluation Results
Method
Method
Links
IQM Return
Bellman Variance Drop (%)
Spectral Routing
Replay Strategy=Spectr...
2026.04
8,160
35
REM
Replay Strategy=Random...
2026.04
7,870
-
PER
Replay Strategy=Priori...
2026.04
7,750
-
Uniform Replay
Replay Strategy=Unifor...
2026.04
7,595
-
Feedback
Search any
task
Search any
task