Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks

About

Deep reinforcement learning (DRL) has achieved remarkable success in various research domains. However, its reliance on neural networks results in a lack of transparency, which limits its practical applications. To achieve explainability, decision trees have emerged as a popular and promising alternative to neural networks. Nonetheless, due to their limited expressiveness, traditional decision trees struggle with high-dimensional long-horizon continuous control tasks. In this paper, we proposes SkillTree, a novel framework that reduces complex continuous action spaces into discrete skill spaces. Our hierarchical approach integrates a differentiable decision tree within the high-level policy to generate skill embeddings, which subsequently guide the low-level policy in executing skills. By making skill decisions explainable, we achieve skill-level explainability, enhancing the understanding of the decision-making process in complex tasks. Experimental results demonstrate that our method achieves performance comparable to skill-based neural networks in complex robotic arm control domains. Furthermore, SkillTree offers explanations at the skill level, thereby increasing the transparency of the decision-making process.

Yongyan Wen, Siyuan Li, Rongchang Zuo, Lei Yuan, Hangyu Mao, Peng Liu• 2024

Related benchmarks

TaskDatasetResultRank
Robotic ManipulationCALVIN--
13
Robotic Subtask CompletionKitchen MKBL
ACS Score3.25
7
Robotic Subtask CompletionOFFICE
ACS2
7
High-Shelf PlacementElephant Robotics 280 Real-world (test)
Success Rate69.8
7
Robotic Subtask CompletionKitchen MLSH
ACS2.62
7
StackingElephant Robotics 280 Real-world (test)
Success Rate71.5
7
Target GraspingElephant Robotics 280 Real-world (test)
Success Rate68.9
7
Block-stackingRoboSuite online finetuning
Mean Success Rate71.8
7
Door OpeningRoboSuite online finetuning
Mean Success Rate54
7
Nut AssemblyRoboSuite online finetuning
Mean Success Rate40.3
7
Showing 10 of 11 rows

Other info

Follow for update