Share your thoughts, 1 month free Claude Pro on usSee more

Safe Reinforcement Learning on Reach (offline)

1.04Normalized Reward

Task-Only

Updated 12d ago

Evaluation Results

Method	Links
Task-Only 2026.05		1.04	1
Oracle 2026.05		1	0.038
SOPL 2026.05		0.98	0.024
Safe-VPL 2026.05		0.98	0.166
Safe-CPL 2026.05		0.98	0.069
RC 2026.05		0.83	0.101