Share your thoughts, 1 month free Claude Pro on usSee more

Offline Reinforcement Learning on D4RL Gym MuJoCo halfcheetah-medium-replay v2

77.2Normalized Average Return

VIPO-MOBILE

Updated 25d ago

Evaluation Results

Method	Links
VIPO-MOBILE 2025.04		77.2
VIPO-MOPO 2025.04		73
MOPO* 2025.04		72.1
MOBILE 2025.04		71.7
RAMBO 2025.04		68.7
COMBO 2025.04		55.1
DQL 2025.04		47.8
CQL 2025.04		45.3
IQL 2025.04		44.2
TD3+BC 2025.04		43.4
MOReL 2025.04		40.2