| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Vehicle Avoidance Moving Obstacles | VSRL | Verified Success Rate (50th Percentile)100 | 14 | 3mo ago | |
| AntCircle (test) | CPPOADRC | Violation Rate0 | 12 | 3mo ago | |
| AntButton (test) | TD3ADRC | Violation Rate0 | 12 | 3mo ago | |
| RacecarPush | TRPO | Violation Rate26.06 | 12 | 3mo ago | |
| RacecarGoal | TRPO | Violation Rate34.03 | 12 | 3mo ago | |
| RacecarCircle (test) | TRPOADRC | Violation Rate0.2364 | 12 | 3mo ago | |
| RacecarButton (test) | Violation Rate72.49 | 12 | 3mo ago | ||
| CarPush | TRPOADRC | Violation Rate34.75 | 12 | 3mo ago | |
| CarGoal | TRPOADRC | Violation Rate29.12 | 12 | 3mo ago | |
| CarCircle (test) | TRPOADRC | Violation Rate17.71 | 12 | 3mo ago | |
| CarButton (test) | CPPOADRC | Violation Rate50.16 | 12 | 3mo ago | |
| MetaDrive | RCDT | Normalized Reward0.69 | 10 | 3mo ago | |
| Bullet Safety Gym | BCQ-Lag | Normalized Reward0.73 | 10 | 3mo ago | |
| Safety Gym POINTGOAL1 (original) | SEditor | Utility Score29 | 10 | 3mo ago | |
| T2D cohort without Pump (Aggregated across cohorts) | CRPO | TIR (%)86.71 | 8 | 3mo ago | |
| Hopper-Velocity | RCPO | Reward1,554.56 | 7 | 3mo ago | |
| 3D Quadrotor Fixed Obstacles | VSRL | Verified-15 Count100 | 7 | 3mo ago | |
| 2D Quadrotor Fixed Obstacles | VSRL | Verified Count (50)100 | 7 | 3mo ago | |
| Lane Following | VSRL | Verified Rate (80)100 | 7 | 3mo ago | |
| OPF with Battery Energy Storage | CUP | Training Time (s)239.4 | 7 | 3mo ago | |
| Spring Pendulum | CUP | Training Time (s)81.4 | 7 | 3mo ago | |
| Safe CartPole | CPO | Training Time (s)68.7 | 7 | 3mo ago | |
| HalfCheetah vel (offline) | Task-Only | Normalized Reward1.85 | 6 | 12d ago | |
| Swimmer-vel (offline) | Task-Only | Normalized Reward2.39 | 6 | 12d ago | |
| Ant-vel (offline) | Task-Only | Normalized Reward1.23 | 6 | 12d ago |