Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational Graph

About

We present a novel two-layer hierarchical reinforcement learning approach equipped with a Goals Relational Graph (GRG) for tackling the partially observable goal-driven task, such as goal-driven visual navigation. Our GRG captures the underlying relations of all goals in the goal space through a Dirichlet-categorical process that facilitates: 1) the high-level network raising a sub-goal towards achieving a designated final goal; 2) the low-level network towards an optimal policy; and 3) the overall system generalizing unseen environments and goals. We evaluate our approach with two settings of partially observable goal-driven tasks -- a grid-world domain and a robotic object search task. Our experimental results show that our approach exhibits superior generalization performance on both unseen environments and new goals.

Xin Ye, Yezhou Yang• 2021

Related benchmarks

Task	Dataset	Result
Robotic object search	AI2-THOR Seen Env., Seen Goals 1.0	Success Rate74	5
Robotic object search	AI2-THOR 1.0 (Seen Env., Unseen Goals)	Success Rate (SR)73	5
Robotic object search	AI2-THOR Unseen Env., Seen Goals 1.0	Success Rate (SR)71	5
Robotic object search	AI2-THOR Unseen Env., Unseen Goals 1.0	Success Rate (SR)75	5
Robotic object search	House3D Single Environment (Seen Goals)	SR88	5
Robotic object search	House3D Single Environment (Unseen Goals)	SR0.79	5
Robotic object search	House3D Multiple Environments (Seen Env.)	Success Rate76	5
Robotic object search	House3D Multiple Environments (Unseen Env.)	SR62	5
Goal-driven navigation	Grid-world Seen Goals (unseen maps)	SR57	5
Goal-driven navigation	Grid-world Unseen Goals (unseen maps)	Success Rate70	5

Showing 10 of 11 rows

Other info

Code

Follow for update

@wizwand_team Discord