Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Constrained Reinforcement Learning on Humanoid
Loading...
1,734.1
Episodic Reward
FOCOPS
1,419.084
1,500.867
1,582.65
1,664.433
Dec 11, 2025
Episodic Reward
Episodic Cost
Updated 1mo ago
Evaluation Results
Method
Method
Links
Episodic Reward
Episodic Cost
FOCOPS
Cost Threshold=20, Num...
2025.12
1,734.1
19.7
P3O
Cost Threshold=20, Num...
2025.12
1,669.4
20.1
e-COP
Cost Threshold=20, Num...
2025.12
1,652.5
17.3
PCPO
Cost Threshold=20, Num...
2025.12
1,602.3
16.3
IPO
Cost Threshold=20, Num...
2025.12
1,578.6
19.1
APPO
Cost Threshold=20, Num...
2025.12
1,488.2
20
CPO
Cost Threshold=20, Num...
2025.12
1,465.1
18.5
PPO-L
Cost Threshold=20, Num...
2025.12
1,431.2
18.8
Feedback
Search any
task
Search any
task