Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Offline Reinforcement Learning on AntMaze umaze-diverse (Normalized Average Return)
Loading...
84
Normalized Average Return
CQL
-3.36
19.32
42
64.68
Apr 16, 2025
Jun 22, 2025
Aug 29, 2025
Nov 5, 2025
Jan 11, 2026
Mar 20, 2026
May 27, 2026
Normalized Average Return
Updated 10d ago
Evaluation Results
Method
Method
Links
Normalized Average Return
CQL
2026.05
84
QT
2026.05
83.7
DC
2026.05
78.5
VIPO-LEQ
2025.04
74.8
Q-ALIGN DT
2026.05
73.2
IQL-TD-MPC
2025.04
72.6
QCS
2026.05
72.3
TD3+BC
2026.05
71.4
LEQ
2025.04
71
CGDT
2026.05
71
RVS
2026.05
70.1
IQL
2026.05
62.2
DT
2026.05
51.2
MOBILE
2025.04
0
Feedback
Search any
task
Search any
task