Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Safe Reinforcement Learning on Safety Gymnasium HalfCheetah (threshold=5)
Loading...
2,619
Return
PPO-Lag
2,062.6
2,207.05
2,351.5
2,495.95
Mar 24, 2026
Return
Cost
Updated 24d ago
Evaluation Results
Method
Method
Links
Return
Cost
PPO-Lag
Type=Oracle
2026.03
2,619
4.82
PPO-BT
2026.03
2,494
5.58
PbCRL
2026.03
2,367
4.66
RLSF
2026.03
2,084
3.26
Feedback
Search any
task
Search any
task