Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Reinforcement Learning on D4RL Maze maze2d-large v2 (test)

156.4Normalized Score

A2PO

-6.77635.58777.95120.313Mar 12, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.03
156.4
2024.03
128.5
116.3
2024.03
69.7
2024.03
57
2024.03
45.7
2024.03
43.9
2024.03
43
2024.03
12.5
2024.03
10.3
2024.03
1.1
2024.03
-0.5