SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks
About
Deep reinforcement learning (DRL) has achieved remarkable success in various research domains. However, its reliance on neural networks results in a lack of transparency, which limits its practical applications. To achieve explainability, decision trees have emerged as a popular and promising alternative to neural networks. Nonetheless, due to their limited expressiveness, traditional decision trees struggle with high-dimensional long-horizon continuous control tasks. In this paper, we proposes SkillTree, a novel framework that reduces complex continuous action spaces into discrete skill spaces. Our hierarchical approach integrates a differentiable decision tree within the high-level policy to generate skill embeddings, which subsequently guide the low-level policy in executing skills. By making skill decisions explainable, we achieve skill-level explainability, enhancing the understanding of the decision-making process in complex tasks. Experimental results demonstrate that our method achieves performance comparable to skill-based neural networks in complex robotic arm control domains. Furthermore, SkillTree offers explanations at the skill level, thereby increasing the transparency of the decision-making process.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Robotic Manipulation | CALVIN | -- | 13 | |
| Robotic Subtask Completion | Kitchen MKBL | ACS Score3.25 | 7 | |
| Robotic Subtask Completion | OFFICE | ACS2 | 7 | |
| High-Shelf Placement | Elephant Robotics 280 Real-world (test) | Success Rate69.8 | 7 | |
| Robotic Subtask Completion | Kitchen MLSH | ACS2.62 | 7 | |
| Stacking | Elephant Robotics 280 Real-world (test) | Success Rate71.5 | 7 | |
| Target Grasping | Elephant Robotics 280 Real-world (test) | Success Rate68.9 | 7 | |
| Block-stacking | RoboSuite online finetuning | Mean Success Rate71.8 | 7 | |
| Door Opening | RoboSuite online finetuning | Mean Success Rate54 | 7 | |
| Nut Assembly | RoboSuite online finetuning | Mean Success Rate40.3 | 7 |