Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational Graph
About
We present a novel two-layer hierarchical reinforcement learning approach equipped with a Goals Relational Graph (GRG) for tackling the partially observable goal-driven task, such as goal-driven visual navigation. Our GRG captures the underlying relations of all goals in the goal space through a Dirichlet-categorical process that facilitates: 1) the high-level network raising a sub-goal towards achieving a designated final goal; 2) the low-level network towards an optimal policy; and 3) the overall system generalizing unseen environments and goals. We evaluate our approach with two settings of partially observable goal-driven tasks -- a grid-world domain and a robotic object search task. Our experimental results show that our approach exhibits superior generalization performance on both unseen environments and new goals.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Robotic object search | AI2-THOR Seen Env., Seen Goals 1.0 | Success Rate74 | 5 | |
| Robotic object search | AI2-THOR 1.0 (Seen Env., Unseen Goals) | Success Rate (SR)73 | 5 | |
| Robotic object search | AI2-THOR Unseen Env., Seen Goals 1.0 | Success Rate (SR)71 | 5 | |
| Robotic object search | AI2-THOR Unseen Env., Unseen Goals 1.0 | Success Rate (SR)75 | 5 | |
| Robotic object search | House3D Single Environment (Seen Goals) | SR88 | 5 | |
| Robotic object search | House3D Single Environment (Unseen Goals) | SR0.79 | 5 | |
| Robotic object search | House3D Multiple Environments (Seen Env.) | Success Rate76 | 5 | |
| Robotic object search | House3D Multiple Environments (Unseen Env.) | SR62 | 5 | |
| Goal-driven navigation | Grid-world Seen Goals (unseen maps) | SR57 | 5 | |
| Goal-driven navigation | Grid-world Unseen Goals (unseen maps) | Success Rate70 | 5 |