Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Constrained Reinforcement Learning on PointCircle
Loading...
110.5
Episodic Reward
e-COP
55.068
69.459
83.85
98.241
Dec 11, 2025
Episodic Reward
Episodic Cost
Updated 4d ago
Evaluation Results
Method
Method
Links
Episodic Reward
Episodic Cost
e-COP
Cost Threshold=10, Num...
2025.12
110.5
9.8
APPO
Cost Threshold=10, Num...
2025.12
91.2
10.2
P3O
Cost Threshold=10, Num...
2025.12
89.1
9.9
FOCOPS
Cost Threshold=10, Num...
2025.12
81.6
10
IPO
Cost Threshold=10, Num...
2025.12
68.7
9.3
PCPO
Cost Threshold=10, Num...
2025.12
68.2
9.9
CPO
Cost Threshold=10, Num...
2025.12
65.3
9.5
PPO-L
Cost Threshold=10, Num...
2025.12
57.2
9.8
Feedback
Search any
task
Search any
task