| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Offline Reinforcement Learning | D4RL Locomotion medium, medium-replay, medium-expert v2 | Score (HalfCheetah, Medium)66.8 | 34 | |
| Offline-to-Online Reinforcement Learning | D4RL Locomotion medium-expert | Average Normalized Return107.9 | 15 | |
| Offline-to-Online Reinforcement Learning | D4RL Locomotion medium | Average Normalized Return98.3 | 15 | |
| Offline-to-Online Reinforcement Learning | D4RL Locomotion medium-replay | Avg Normalized Return90.8 | 15 |