Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Safe Reinforcement Learning on SafetyPointPush2 v0

0.6Mean Reward

PPO-LAG

-0.8248-0.4549-0.0850.2849Apr 3, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.04
0.61.5831.3458.17
2026.04
0.413.1159.86120.18
2026.04
0.22.5106.88216.19
2026.04
-0.070.527.737.51
2026.04
-0.775.5128.2267.21