Graph Constrained Reinforcement Learning for Natural Language Action Spaces

About

Interactive Fiction games are text-based simulations in which an agent interacts with the world purely through natural language. They are ideal environments for studying how to extend reinforcement learning agents to meet the challenges of natural language understanding, partial observability, and action generation in combinatorially-large text-based action spaces. We present KG-A2C, an agent that builds a dynamic knowledge graph while exploring and generates actions using a template-based action space. We contend that the dual uses of the knowledge graph to reason about game state and to constrain natural language generation are the keys to scalable exploration of combinatorially large natural language actions. Results across a wide variety of IF games show that KG-A2C outperforms current IF agents despite the exponential increase in action space size.

Prithviraj Ammanabrolu, Matthew Hausknecht• 2020

Related benchmarks

Task	Dataset	Result
Knowledge Graph Prediction	JerichoWorld 1.0 (val test)	Zork1 Score8.42	20
Clean up messy room (Text-based Game)	TWC Medium (in-distribution)	Steps41.61	8
Clean up messy room (Text-based Game)	TWC In-distribution Hard	Steps48	8
Clean up messy room (Text-based Game)	TWC In-distribution Easy	Steps22.1	8
Interactive Science Simulation	ScienceWorld v1.0 (test)	Task 1-1 (L) Score0.00e+0	8
Science simulation and text-based scientific reasoning	ScienceWorld variations (test)	Changes of State: Boiling Success0.00e+0	7
Text-based Reinforcement Learning	Jericho benchmark (test)	DeepHome Score1	7

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord