Dual Goal Representations
About
In this work, we introduce dual goal representations for goal-conditioned reinforcement learning (GCRL). A dual goal representation characterizes a state by "the set of temporal distances from all other states"; in other words, it encodes a state through its relations to every other state, measured by temporal distance. This representation provides several appealing theoretical properties. First, it depends only on the intrinsic dynamics of the environment and is invariant to the original state representation. Second, it contains provably sufficient information to recover an optimal goal-reaching policy, while being able to filter out exogenous noise. Based on this concept, we develop a practical goal representation learning method that can be combined with any existing GCRL algorithm. Through diverse experiments on the OGBench task suite, we empirically show that dual goal representations consistently improve offline goal-reaching performance across 20 state- and pixel-based tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Offline Reinforcement Learning | D4RL Franka Kitchen | Mixed Success Rate77 | 43 | |
| Goal-conditioned manipulation | OGBench puzzle-4x4-play | Score0.34 | 24 | |
| Robotic Manipulation | D4RL Kitchen-Partial | Normalized Score100 | 23 | |
| Manipulation | OGBench cube-triple-play | Success Rate18 | 19 | |
| Robotic Manipulation | D4RL Kitchen-Mixed | -- | 14 | |
| Goal-conditioned manipulation | OGBench cube single-play | Success Rate89 | 12 | |
| Goal-conditioned manipulation | OGBench scene-play | Success Rate87 | 12 | |
| Goal-conditioned manipulation | OGBench puzzle 3x3-play | Success Rate79 | 12 | |
| Goal-conditioned manipulation | OGBench puzzle-4x5-play | Success Rate10 | 12 | |
| Goal-conditioned manipulation | OGBench puzzle-4x6-play | Success Rate6 | 12 |