Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LISA: Learning Interpretable Skill Abstractions from Language

About

Learning policies that effectively utilize language instructions in complex, multi-task environments is an important problem in sequential decision-making. While it is possible to condition on the entire language instruction directly, such an approach could suffer from generalization issues. In our work, we propose \emph{Learning Interpretable Skill Abstractions (LISA)}, a hierarchical imitation learning framework that can learn diverse, interpretable primitive behaviors or skills from language-conditioned demonstrations to better generalize to unseen instructions. LISA uses vector quantization to learn discrete skill codes that are highly correlated with language instructions and the behavior of the learned policy. In navigation and robotic manipulation environments, LISA outperforms a strong non-hierarchical Decision Transformer baseline in the low data regime and is able to compose learned skills to solve tasks containing unseen long-range instructions. Our method demonstrates a more natural way to condition on language in sequential decision-making problems and achieve interpretable and controllable behavior with the learned skills.

Divyansh Garg, Skanda Vaidyanath, Kuno Kim, Jiaming Song, Stefano Ermon• 2022

Related benchmarks

TaskDatasetResultRank
Imitation LearningBabyAI GoToSeq (test)
Success Rate77.2
9
Imitation LearningBabyAI SynthSeq (test)
Success Rate0.612
9
Imitation LearningBabyAI BossLevel (test)
Success Rate69.8
9
Robotic ManipulationLOReL Sawyer Dataset (test)
Close Drawer Success Rate1
6
Robot ManipulationLOReL Sawyer (unseen multi-step composition)
Success Rate20.89
4
Imitation LearningLOREL States (fully obs.) (test)
Success Rate66.7
3
Imitation LearningLOREL Images (partial obs.) (test)
Success Rate40
3
Language-conditioned robot manipulationLOReL Sawyer 15 unseen instructions (test)
Success Rate20.89
3
Language-conditioned robot task executionLOReL Sawyer seen rephrasals
Success Rate40
3
Language-conditioned robot task executionLOReL Sawyer unseen noun rephrasals
Success Rate33.33
3
Showing 10 of 14 rows

Other info

Follow for update