Share your thoughts, 1 month free Claude Pro on usSee more

Safe Reinforcement Learning on Ant-vel (offline)

1.23Normalized Reward

Task-Only

Updated 12d ago

Evaluation Results

Method	Links
Task-Only 2026.05		1.23	1
Oracle 2026.05		1	0.002
SOPL 2026.05		0.98	0.007
Safe-VPL 2026.05		0.9	0.001
RC 2026.05		0.77	0.078
Safe-CPL 2026.05		0.76	0.007