Causal Induction from Visual Observations for Goal Directed Tasks
About
Causal reasoning has been an indispensable capability for humans and other intelligent animals to interact with the physical world. In this work, we propose to endow an artificial agent with the capability of causal reasoning for completing goal-directed tasks. We develop learning-based approaches to inducing causal knowledge in the form of directed acyclic graphs, which can be used to contextualize a learned goal-conditional policy to perform tasks in novel environments with latent causal structures. We leverage attention mechanisms in our causal induction model and goal-conditional policy, enabling us to incrementally generate the causal graph from the agent's visual observations and to selectively use the induced graph for determining actions. Our experiments show that our method effectively generalizes towards completing new tasks in novel environments with previously unseen causal structures.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Box/Door Unlocking | Unlock Spuriousness S (test) | Success Rate32.7 | 10 | |
| Box/Door Unlocking | Unlock Composition C (test) | Success Rate3.15e+3 | 10 | |
| Box/Door Unlocking | Unlock In-distribution I (test) | Success Rate3.17e+3 | 10 | |
| Crash Avoidance | Crash In-distribution I (test) | Success Rate27.9 | 10 | |
| Crash Avoidance | Crash Composition C (test) | Success Rate7.8 | 10 | |
| Object Stacking | Stack In-distribution I (test) | Success Rate71.8 | 10 | |
| Object Stacking | Stack Spuriousness S (test) | Success Rate71 | 10 | |
| Object Stacking | Stack Composition C (test) | Success Rate58.6 | 10 | |
| Crash Avoidance | Crash Spuriousness S (test) | Success Rate15.8 | 10 | |
| Causal Discovery | Chemistry environment Chain (ID) | Success Rate37.5 | 9 |