Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Online Reinforcement Learning on MinAtar (|A|=216, k=3) Macro-Action
Loading...
11.98
Breakout Score
DQN
5.584
7.2445
8.905
10.5655
May 12, 2026
Breakout Score
Asterix Score
Freeway Score
Seaquest Score
Space Invaders Score
Average Score
Updated 21d ago
Evaluation Results
Method
Method
Links
Breakout Score
Asterix Score
Freeway Score
Seaquest Score
Space Invaders Score
Average Score
DQN
Number of seeds=5
2026.05
11.98
0.67
0.66
0.52
10.36
4.84
DRIFT
Candidate budget=40, C...
2026.05
8.4
0.78
0.56
1.2
9.45
4.08
DRIFT (full)
Candidate budget=full,...
2026.05
7.8
0.56
0.21
0.85
6.4
3.16
DQN-Sub
Number of seeds=5
2026.05
7.45
0.53
0
0.98
10.77
3.95
PPO
Number of seeds=5
2026.05
5.83
0.65
27.9
0.66
14.25
9.86
Feedback
Search any
task
Search any
task