Share your thoughts, 1 month free Claude Pro on usSee more

Constrained Reinforcement Learning on Humanoid

1,734.1Episodic Reward

FOCOPS

Updated 4mo ago

Evaluation Results

Method	Links
FOCOPS 2025.12		1,734.1	19.7
P3O 2025.12		1,669.4	20.1
e-COP 2025.12		1,652.5	17.3
PCPO 2025.12		1,602.3	16.3
IPO 2025.12		1,578.6	19.1
APPO 2025.12		1,488.2	20
CPO 2025.12		1,465.1	18.5
PPO-L 2025.12		1,431.2	18.8