Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

How to Solve Contextual Goal-Oriented Problems with Offline Datasets?

About

We present a novel method, Contextual goal-Oriented Data Augmentation (CODA), which uses commonly available unlabeled trajectories and context-goal pairs to solve Contextual Goal-Oriented (CGO) problems. By carefully constructing an action-augmented MDP that is equivalent to the original MDP, CODA creates a fully labeled transition dataset under training contexts without additional approximation error. We conduct a novel theoretical analysis to demonstrate CODA's capability to solve CGO problems in the offline data setup. Empirical results also showcase the effectiveness of CODA, which outperforms other baseline methods across various context-goal relationships of CGO problem. This approach offers a promising direction to solving CGO problems using offline datasets.

Ying Fan, Jingling Li, Adith Swaminathan, Aditya Modi, Ching-An Cheng• 2024

Related benchmarks

TaskDatasetResultRank
Goal ReachingAntMaze large-play v2
Success Rate60
10
Goal ReachingAntMaze medium-play v2
Success Rate76.8
10
Goal ReachingAntMaze umaze v2
Success Rate94.8
6
Goal ReachingAntMaze Medium-Diverse v2
Success Rate84.5
6
Goal ReachingAntMaze large-diverse v2
Success Rate36.8
6
Goal ReachingAntMaze umaze-diverse v2
Success Rate72.8
6
Goal-reaching NavigationFour Rooms medium-play v1 (test)
Average Success Rate0.787
4
Goal-reaching NavigationFour Rooms large-diverse v1 (test)
Success Rate72.2
4
Offline Context-conditioned Goal-oriented (CGO) Reinforcement LearningRandom Cells (medium-diverse)
Success Rate72.5
4
Offline Context-conditioned Goal-oriented (CGO) Reinforcement LearningRandom Cells (large-play)
Success Rate60.2
4
Showing 10 of 16 rows

Other info

Code

Follow for update