Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Goal-Conditioned Decision Transformer for Multi-Goal Offline Reinforcement Learning

About

Reinforcement learning (RL) in robotics faces significant hurdles regarding sample efficiency and generalization across varying goals. While Offline RL mitigates the need for costly online interactions, its integration with goal-conditioned policies and transformer-based architectures remains underexplored. We introduce a Goal-Conditioned Decision Transformer adapted for offline multi-goal robotics. By explicitly incorporating goal states into the sequence modeling framework, our approach efficiently solves varying tasks using only pre-collected data. We validate this method on a newly released offline dataset for the Franka Emika Panda platform. Experimental results demonstrate that our approach outperforms state-of-the-art online baselines in complex tasks and maintains robustness in sparse-reward settings, even with limited expert demonstrations.

Pawe{\l} Gajewski, Dominik \.Zurek, Marcin Pietro\'n, Kamil Faber• 2024

Related benchmarks

TaskDatasetResultRank
PickAndPlacePickAndPlace Dense Reward
Return-1.3
3
PickAndPlacePickAndPlace Sparse Reward
Return-7.63
3
PushPush Dense Reward
Return-0.95
3
ReachReach Dense Reward
Return-0.21
3
ReachReach Sparse Reward
Return-1.72
3
PushPush Sparse Reward
Return-8.26
3
Showing 6 of 6 rows

Other info

Follow for update