Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational Graph

About

We present a novel two-layer hierarchical reinforcement learning approach equipped with a Goals Relational Graph (GRG) for tackling the partially observable goal-driven task, such as goal-driven visual navigation. Our GRG captures the underlying relations of all goals in the goal space through a Dirichlet-categorical process that facilitates: 1) the high-level network raising a sub-goal towards achieving a designated final goal; 2) the low-level network towards an optimal policy; and 3) the overall system generalizing unseen environments and goals. We evaluate our approach with two settings of partially observable goal-driven tasks -- a grid-world domain and a robotic object search task. Our experimental results show that our approach exhibits superior generalization performance on both unseen environments and new goals.

Xin Ye, Yezhou Yang• 2021

Related benchmarks

TaskDatasetResultRank
Robotic object searchAI2-THOR Seen Env., Seen Goals 1.0
Success Rate74
5
Robotic object searchAI2-THOR 1.0 (Seen Env., Unseen Goals)
Success Rate (SR)73
5
Robotic object searchAI2-THOR Unseen Env., Seen Goals 1.0
Success Rate (SR)71
5
Robotic object searchAI2-THOR Unseen Env., Unseen Goals 1.0
Success Rate (SR)75
5
Robotic object searchHouse3D Single Environment (Seen Goals)
SR88
5
Robotic object searchHouse3D Single Environment (Unseen Goals)
SR0.79
5
Robotic object searchHouse3D Multiple Environments (Seen Env.)
Success Rate76
5
Robotic object searchHouse3D Multiple Environments (Unseen Env.)
SR62
5
Goal-driven navigationGrid-world Seen Goals (unseen maps)
SR57
5
Goal-driven navigationGrid-world Unseen Goals (unseen maps)
Success Rate70
5
Showing 10 of 11 rows

Other info

Code

Follow for update