Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-Objective Reinforcement Learning on Maze

223.55Mean Episode Reward (MER)

RANDOM

-5.14654.227113.6172.973Mar 24, 2026
Updated 25d ago

Evaluation Results

MethodLinks
2026.03
223.550
2026.03
85.550.05
40.3362.92
2026.03
30.1659.04
2026.03
30.1659.04
2026.03
27.3542.94
2026.03
23.660.01
2026.03
16.151.12
2026.03
15.2861.13
2026.03
10.360.01
2026.03
3.650