Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Online Reinforcement Learning on MinAtar (|A|=216, k=3) Macro-Action

11.98Breakout Score

DQN

5.5847.24458.90510.5655May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
11.980.670.660.5210.364.84
2026.05
8.40.780.561.29.454.08
2026.05
7.80.560.210.856.43.16
2026.05
7.450.5300.9810.773.95
2026.05
5.830.6527.90.6614.259.86