Share your thoughts, 1 month free Claude Pro on usSee more

Offline Reinforcement Learning on AntMaze umaze-diverse (Normalized Average Return)

84Normalized Average Return

CQL

Updated 10d ago

Evaluation Results

Method	Links
CQL 2026.05		84
QT 2026.05		83.7
DC 2026.05		78.5
VIPO-LEQ 2025.04		74.8
Q-ALIGN DT 2026.05		73.2
IQL-TD-MPC 2025.04		72.6
QCS 2026.05		72.3
TD3+BC 2026.05		71.4
LEQ 2025.04		71
CGDT 2026.05		71
RVS 2026.05		70.1
IQL 2026.05		62.2
DT 2026.05		51.2
MOBILE 2025.04		0