Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Compositional Transduction with Latent Analogies for Offline Goal-Conditioned Reinforcement Learning

About

Compositional generalization is essential for reaching unseen goals under novel contextual variations in offline goal-conditioned reinforcement learning (GCRL), where a generalist goal-reaching agent must be learned from limited data. Most prior approaches pursue this via trajectory stitching over temporally contiguous segments, which limits composing behaviors across varying contexts. To overcome this limitation, we formalize analogy transduction as synthesizing new plans by composing task-endogenous analogies with given contexts and propose a novel analogy representation tailored for it. Grounded in our theory, this analogy representation captures what changes under optimal task execution, remains invariant to contextual variations, and is sufficient for optimal goal reaching. We further contend that generalization to unseen analogy-context pairs is a practical obstacle in analogy transduction, and introduce a new approach for offline GCRL that enables analogy transduction beyond seen pairs to unseen combinations. We empirically demonstrate the effectiveness of our approach on OGBench manipulation environments, substantially outperforming prior methods that do not perform analogy transduction. Project page: https://rllab-snu.github.io/projects/CTA/

Junseok Kim, Dohyeong Kim, Mineui Hong, Songhwai Oh• 2026

Related benchmarks

TaskDatasetResultRank
Goal-conditioned manipulationOGBench puzzle-4x4-play
Score0.84
24
ManipulationOGBench cube-triple-play
Success Rate17
19
Offline Goal-Conditioned Reinforcement Learninghumanoidmaze large-navigate v0
Success Rate60
14
Goal-conditioned manipulationOGBench scene-play
Success Rate90
12
Goal-conditioned manipulationOGBench puzzle-4x5-play
Success Rate17
12
Goal-conditioned manipulationOGBench puzzle 3x3-play
Success Rate94
12
Goal-conditioned manipulationOGBench puzzle-4x6-play
Success Rate12
12
Goal-conditioned manipulationOGBench cube single-play
Success Rate86
12
Offline Goal-Conditioned Reinforcement LearningOGBench humanoidmaze-medium-navigate
Success Rate90
7
Offline Goal-Conditioned Reinforcement LearningOGBench antmaze-giant-navigate
Success Rate54
7
Showing 10 of 13 rows

Other info

Follow for update