| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Safety-Gym SafetyPointCircle1 (evaluation) | PPO | Average Reward25.15 | 11 | 19d ago | |
| Safety-Gym SafetyPointGoal1 (evaluation) | PPO | Average Reward11.37 | 11 | 19d ago | |
| Extended Chain CMDP (last 1,000 episodes) | Unconstrained | Jc2 Constraint Metric0.069 | 3 | 1mo ago | |
| Grid-world Time-Variant Safety Threshold (100 randomly generated environments) | MASE | Safety Violations0 | 2 | 3mo ago | |
| Grid-world Time-Invariant Safety Threshold (100 randomly generated environments) | SNO-MDP | Safety Violation Count0 | 2 | 3mo ago |