Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deep Reinforcement Learning on Montezuma's Revenge Atari 2600
Loading...
0
IQM Return
Uniform Replay
-0.001
-0.0005
0
0.0005
Apr 15, 2026
IQM Return
Bellman Target Variance Drop (%)
Updated 3d ago
Evaluation Results
Method
Method
Links
IQM Return
Bellman Target Variance Drop (%)
Uniform Replay
Replay Strategy=Unifor...
2026.04
0
-
PER
Replay Strategy=Priori...
2026.04
0
-
REM
Replay Strategy=Random...
2026.04
0
-
Spectral Routing
Replay Strategy=Spectr...
2026.04
0
1
Feedback
Search any
task
Search any
task