Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Density Constrained Reinforcement Learning

About

We study constrained reinforcement learning (CRL) from a novel perspective by setting constraints directly on state density functions, rather than the value functions considered by previous works. State density has a clear physical and mathematical interpretation, and is able to express a wide variety of constraints such as resource limits and safety requirements. Density constraints can also avoid the time-consuming process of designing and tuning cost functions required by value function-based constraints to encode system specifications. We leverage the duality between density functions and Q functions to develop an effective algorithm to solve the density constrained RL problem optimally and the constrains are guaranteed to be satisfied. We prove that the proposed algorithm converges to a near-optimal solution with a bounded error even when the policy update is imperfect. We use a set of comprehensive experiments to demonstrate the advantages of our approach over state-of-the-art CRL methods, with a wide range of density constrained tasks as well as standard CRL benchmarks such as Safety-Gym.

Zengyi Qin, Yuxiao Chen, Chuchu Fan• 2021

Related benchmarks

TaskDatasetResultRank
PointGoal2Safety Gymnasium
Normalized Reward28
21
PointButton2Safety Gymnasium
Normalized Reward18
21
PointPush2Safety Gymnasium
Normalized Reward2
21
PointButton1Safety Gymnasium
Normalized Reward1
21
PointGoal1Safety Gymnasium
Normalized Reward0.24
21
PointPush1Safety Gymnasium
Normalized Reward1
21
CarButton2Safety Gymnasium
Reward0.09
19
CarButton1Safety Gymnasium
Reward0.12
19
CarGoal1Safety Gymnasium
Reward35
19
CarGoal2Safety Gymnasium
Reward0.11
19
Showing 10 of 13 rows

Other info

Follow for update