| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Safety-constrained Reinforcement Learning | Safety-Gym SafetyPointCircle1 (evaluation) | Average Reward25.15 | 11 | |
| Safety-constrained Reinforcement Learning | Safety-Gym SafetyPointGoal1 (evaluation) | Average Reward11.37 | 11 | |
| Safe Reinforcement Learning | Safety Gym POINTGOAL1 (original) | Utility Score29 | 10 | |
| Safe Locomotion | Safety-Gym Hazard | Safe State Ratio40 | 7 | |
| Continuous Control | Safety-Gym Navigation | Final Reward38.5 | 7 | |
| Walker2dVelocity | Safety Gym | Normalized Reward93 | 6 | |
| SwimmerVelocity | Safety Gym | Normalized Reward0.94 | 6 | |
| PointGoal | Safety Gym | Normalized Reward0.64 | 6 | |
| HopperVelocity | Safety Gym | Normalized Reward93 | 6 | |
| HalfCheetahVelocity | Safety Gym | Normalized Reward99 | 6 | |
| CarGoal | Safety Gym | Normalized Reward0.44 | 6 | |
| AntVelocity | Safety Gym | Normalized Reward0.93 | 6 | |
| Reinforcement Learning | Safety Gym AntVel | Reward1 | 6 | |
| Reinforcement Learning | Safety Gym Walker2dVel | Reward0.79 | 6 | |
| Reinforcement Learning | Safety Gym HalfCheetahVel | Reward0.97 | 6 | |
| Reinforcement Learning | Safety Gym HopperVel | Reward0.7 | 6 | |
| Reinforcement Learning | Safety Gym SwimmerVel | Reward0.63 | 6 | |
| Reinforcement Learning | Safety Gym (test) | Point Goal 1 Success19.13 | 5 | |
| Safety Validation | Safety Gym | Safety Gym Cost Rate0.014 | 3 | |
| Safe Reinforcement Learning | Safety Gym | Metric- | 0 |