Share your thoughts, 1 month free Claude Pro on usSee more

Constrained Reinforcement Learning on Navigation

217.6Episodic Reward

e-COP

Updated 4mo ago

Evaluation Results

Method	Links
e-COP 2025.12		217.6	9.6	23.7
PPO-L 2025.12		175.1	9.9	22.3
IPO 2025.12		164.1	10	24.6
P3O 2025.12		153.5	9.9	24.5
APPO 2025.12		135.7	9.9	23.9