Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Constrained Reinforcement Learning on Navigation
Loading...
217.6
Episodic Reward
e-COP
132.424
154.537
176.65
198.763
Dec 11, 2025
Episodic Reward
Episodic Cost 1
Episodic Cost 2
Updated 4d ago
Evaluation Results
Method
Method
Links
Episodic Reward
Episodic Cost 1
Episodic Cost 2
e-COP
Cost Threshold C1=10,...
2025.12
217.6
9.6
23.7
PPO-L
Cost Threshold C1=10,...
2025.12
175.1
9.9
22.3
IPO
Cost Threshold C1=10,...
2025.12
164.1
10
24.6
P3O
Cost Threshold C1=10,...
2025.12
153.5
9.9
24.5
APPO
Cost Threshold C1=10,...
2025.12
135.7
9.9
23.9
Feedback
Search any
task
Search any
task