Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Offline-to-Online Reinforcement Learning on HalfCheetah D4RL Suite
Loading...
49.37
Return (HalfCheetah Random)
ROAD
41.1644
43.2947
45.425
47.5553
May 14, 2026
Return (HalfCheetah Random)
Return (HalfCheetah Medium-Replay)
Return (HalfCheetah Medium)
Return (HalfCheetah Medium-Expert)
Return (HalfCheetah Expert)
Updated 19d ago
Evaluation Results
Method
Method
Links
Return (HalfCheetah Random)
Return (HalfCheetah Medium-Replay)
Return (HalfCheetah Medium)
Return (HalfCheetah Medium-Expert)
Return (HalfCheetah Expert)
ROAD
Mixing Ratio Strategy=...
2026.05
49.37
55.82
74.57
95.06
96.86
BR
Mixing Ratio Strategy=...
2026.05
47.43
49.28
72.49
93.34
94.91
0.0
Mixing Ratio Strategy=...
2026.05
41.48
50.7
69.61
63.75
78.28
Feedback
Search any
task
Search any
task