Share your thoughts, 1 month free Claude Pro on usSee more

Safe Reinforcement Learning on Swimmer-vel (offline)

2.39Normalized Reward

Task-Only

Updated 12d ago

Evaluation Results

Method	Links
Task-Only 2026.05		2.39	1
SOPL 2026.05		1.34	0.014
Oracle 2026.05		1	0
Safe-CPL 2026.05		0.99	0.005
Safe-VPL 2026.05		0.88	0.001
RC 2026.05		0.83	0