Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Safe Reinforcement Learning on SafetyPointGoal2 v0

15.58Mean Reward

TRPO

-1.7882.7217.2311.739Apr 3, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.04
15.5810.31164.1488.43
2026.04
13.2614.05167.4687.06
2026.04
2.378.4689.04187.67
2026.04
2.245.154.164.5
2026.04
-1.121.1124.3237.84