Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deep Reinforcement Learning on Pitfall Atari 2600
Loading...
12
IQM Return
Uniform Replay
-0.48
2.76
6
9.24
Apr 15, 2026
IQM Return
Bellman Target Variance Drop
Updated 3d ago
Evaluation Results
Method
Method
Links
IQM Return
Bellman Target Variance Drop
Uniform Replay
Replay Strategy=Unifor...
2026.04
12
-
PER
Replay Strategy=Priori...
2026.04
10
-
REM
Replay Strategy=Random...
2026.04
8
-
Spectral Routing
Replay Strategy=Spectr...
2026.04
0
11
Feedback
Search any
task
Search any
task