Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Offline Reinforcement Learning on HalfCheetah Medium Delayed Reward (D4RL)
Loading...
42.9
Score
ISCT
-0.676
10.637
21.95
33.263
Feb 12, 2026
Score
Updated 3mo ago
Evaluation Results
Method
Method
Links
Score
ISCT
Reward Type=Delayed
2026.02
42.9
QDT
Reward Type=Delayed
2026.02
42.4
DT
Reward Type=Delayed
2026.02
42.2
CQL
Reward Type=Delayed
2026.02
1
Feedback
Search any
task
Search any
task