Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Offline Reinforcement Learning on HalfCheetah Medium-Replay Delayed Reward D4RL
Loading...
42.4
Score
ISCT
6.416
15.758
25.1
34.442
Feb 12, 2026
Score
Updated 3mo ago
Evaluation Results
Method
Method
Links
Score
ISCT
Reward Type=Delayed
2026.02
42.4
DT
Reward Type=Delayed
2026.02
33
QDT
Reward Type=Delayed
2026.02
32.8
CQL
Reward Type=Delayed
2026.02
7.8
Feedback
Search any
task
Search any
task