| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Offline Control | Heterogeneous Pendulum Rich-Data 600,000 transition steps | Cumulative Reward-0.6 | 7 | |
| Offline Control | Heterogeneous Pendulum 300,000 transition steps (Mid-Data) | Cumulative Reward-1.25 | 7 | |
| Offline Control | Heterogeneous Pendulum Low-Data 100,000 transition steps | Cumulative Reward-1.39 | 7 |