Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Offline Reinforcement Learning on D4RL Maze maze2d-umaze v2 (test)
Loading...
13,330
Normalized Score
A2PO
-2,134.8
1,880.1
5,895
9,909.9
Mar 12, 2024
Normalized Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Normalized Score
A2PO
2024.03
13,330
AWAC
2024.03
9,450
LAPO
2024.03
7,800
Diffusion-QL
2024.03
6,670
EQL
2024.03
5,650
IQL
2024.03
5,620
BCQ
2024.03
2,480
TD3+BC
2024.03
2,420
CQL+AW
2024.03
1,960
CQL
2024.03
570
BC
2024.03
50
MOPO
2024.03
-1,540
Feedback
Search any
task
Search any
task