Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Continual Reinforcement Learning on MiniGrid

70.6Mean SR

TELAPA

9.65625.47841.357.122Apr 16, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
70.63.3530.35062.925
2026.04
58.86.5526.23769.514
57.65.9126.33370.915
52.55.495.4300.924
2026.04
50.75.5929.43870.212
2026.04
46.36.86333384.46
2026.04
38.88.0410.1921.21
2026.04
37.47.7528.33374.82
2026.04
14.39.679511.55
2026.04
129.747.28142