Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deep Reinforcement Learning on Seaquest Atari 2600
Loading...
1,642
IQM Return
Spectral Routing
1,482.88
1,524.19
1,565.5
1,606.81
Apr 15, 2026
IQM Return
Bellman Target Variance Drop
Updated 3d ago
Evaluation Results
Method
Method
Links
IQM Return
Bellman Target Variance Drop
Spectral Routing
Replay Strategy=Spectr...
2026.04
1,642
42
REM
Replay Strategy=Random...
2026.04
1,550
-
PER
Replay Strategy=Priori...
2026.04
1,525
-
Uniform Replay
Replay Strategy=Unifor...
2026.04
1,489
-
Feedback
Search any
task
Search any
task