Density Constrained Reinforcement Learning

About

We study constrained reinforcement learning (CRL) from a novel perspective by setting constraints directly on state density functions, rather than the value functions considered by previous works. State density has a clear physical and mathematical interpretation, and is able to express a wide variety of constraints such as resource limits and safety requirements. Density constraints can also avoid the time-consuming process of designing and tuning cost functions required by value function-based constraints to encode system specifications. We leverage the duality between density functions and Q functions to develop an effective algorithm to solve the density constrained RL problem optimally and the constrains are guaranteed to be satisfied. We prove that the proposed algorithm converges to a near-optimal solution with a bounded error even when the policy update is imperfect. We use a set of comprehensive experiments to demonstrate the advantages of our approach over state-of-the-art CRL methods, with a wide range of density constrained tasks as well as standard CRL benchmarks such as Safety-Gym.

Zengyi Qin, Yuxiao Chen, Chuchu Fan• 2021

Related benchmarks

Task	Dataset	Result
PointGoal2	Safety Gymnasium	Normalized Reward28	21
PointButton2	Safety Gymnasium	Normalized Reward18	21
PointPush2	Safety Gymnasium	Normalized Reward2	21
PointButton1	Safety Gymnasium	Normalized Reward1	21
PointGoal1	Safety Gymnasium	Normalized Reward0.24	21
PointPush1	Safety Gymnasium	Normalized Reward1	21
CarButton2	Safety Gymnasium	Reward0.09	19
CarButton1	Safety Gymnasium	Reward0.12	19
CarGoal1	Safety Gymnasium	Reward35	19
CarGoal2	Safety Gymnasium	Reward0.11	19

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord