Share your thoughts, 1 month free Claude Pro on usSee more

Safety Reinforcement Learning on SafetyPointGoal1 v0

27.45Reward

DDPG-AdaGamma

Updated 2mo ago

Evaluation Results

Method	Links
DDPG-AdaGamma 2026.05		27.45	46.15
DDPG-Fixed-γ 2026.05		27.34	51.45
DDPG-CrossValidate 2026.05		27.26	53.85
DDPG-Uncertainty 2026.05		27.1	52.7
TRPO-AdaGamma 2026.05		25.93	49.4
TRPO-Fixed-γ 2026.05		24.48	45.9
TRPO-CrossValidate 2026.05		24.38	46.35
TRPO-Uncertainty 2026.05		23.5	64.55