Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Offline Reinforcement Learning on D4RL HalfCheetah Med-Expert
Loading...
105.9
Normalized Return
MBOP
51.196
65.398
79.6
93.802
Oct 30, 2023
Normalized Return
Updated 1mo ago
Evaluation Results
Method
Method
Links
Normalized Return
MBOP
2023.10
105.9
TT
2023.10
95
CQL
2023.10
91.6
RGG+
planning seeds=15
2023.10
91.2
RGG
planning seeds=15
2023.10
90.8
DT
2023.10
86.8
IQL
2023.10
86.7
Diffuser
2023.10
79.8
MOPO
2023.10
63.3
BC
2023.10
55.2
MOREL
2023.10
53.3
Feedback
Search any
task
Search any
task