Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Offline Reinforcement Learning on D4RL MuJoCo walker2d medium-replay v2
Loading...
87.5
Normalized Score
COUNT
73.044
76.797
80.55
84.303
Mar 9, 2026
Normalized Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Normalized Score
COUNT
training steps=1M, see...
2026.03
87.5
ROMI
training steps=1M, see...
2026.03
85.2
CQL
training steps=1M, see...
2026.03
81.8
RAMBO
training steps=1M, see...
2026.03
81.2
MOBILE
training steps=1M, see...
2026.03
75
IQL
training steps=1M, see...
2026.03
74.8
MOPO
training steps=1M, see...
2026.03
73.6
Feedback
Search any
task
Search any
task