Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space

About

Most existing deep reinforcement learning (DRL) frameworks consider either discrete action space or continuous action space solely. Motivated by applications in computer games, we consider the scenario with discrete-continuous hybrid action space. To handle hybrid action space, previous works either approximate the hybrid space by discretization, or relax it into a continuous set. In this paper, we propose a parametrized deep Q-network (P- DQN) framework for the hybrid action space without approximation or relaxation. Our algorithm combines the spirits of both DQN (dealing with discrete action space) and DDPG (dealing with continuous action space) by seamlessly integrating them. Empirical results on a simulation example, scoring a goal in simulated RoboCup soccer and the solo mode in game King of Glory (KOG) validate the efficiency and effectiveness of our method.

Jiechao Xiong, Qing Wang, Zhuoran Yang, Peng Sun, Lei Han, Yang Zheng, Haobo Fu, Tong Zhang, Ji Liu, Han Liu• 2018

Related benchmarks

TaskDatasetResultRank
Half Field OffenseHalf Field Offense (HFO) (evaluation)
P(Goal)0.989
7
Air-to-Air CombatAir-to-Air Combat Easy
Success Rate81.2
7
Air-to-Air CombatAir-to-Air Combat Medium
Success Rate74.2
7
Air-to-Air CombatAir-to-Air Combat Hard
Success Rate57.2
7
Maze NavigationMaze Navigation Easy
Success Rate85.8
7
Maze NavigationMaze Navigation Medium
Success Rate68.8
7
Maze NavigationMaze Navigation Hard
Success Rate38.4
7
Platform ControlPlatform (Evaluation)
Return96.4
5
Robot SoccerRobot Soccer Goal (Evaluation)
P(Goal)70.1
5
Showing 9 of 9 rows

Other info

Follow for update