Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Offline Reinforcement Learning on D4RL Gym MuJoCo halfcheetah-medium-replay v2
Loading...
77.2
Normalized Average Return
VIPO-MOBILE
38.72
48.71
58.7
68.69
Apr 16, 2025
Normalized Average Return
Updated 25d ago
Evaluation Results
Method
Method
Links
Normalized Average Return
VIPO-MOBILE
2025.04
77.2
VIPO-MOPO
2025.04
73
MOPO*
Retrained on v2=true
2025.04
72.1
MOBILE
2025.04
71.7
RAMBO
2025.04
68.7
COMBO
2025.04
55.1
DQL
2025.04
47.8
CQL
2025.04
45.3
IQL
2025.04
44.2
TD3+BC
2025.04
43.4
MOReL
2025.04
40.2
Feedback
Search any
task
Search any
task