Compositional Transduction with Latent Analogies for Offline Goal-Conditioned Reinforcement Learning

About

Compositional generalization is essential for reaching unseen goals under novel contextual variations in offline goal-conditioned reinforcement learning (GCRL), where a generalist goal-reaching agent must be learned from limited data. Most prior approaches pursue this via trajectory stitching over temporally contiguous segments, which limits composing behaviors across varying contexts. To overcome this limitation, we formalize analogy transduction as synthesizing new plans by composing task-endogenous analogies with given contexts and propose a novel analogy representation tailored for it. Grounded in our theory, this analogy representation captures what changes under optimal task execution, remains invariant to contextual variations, and is sufficient for optimal goal reaching. We further contend that generalization to unseen analogy-context pairs is a practical obstacle in analogy transduction, and introduce a new approach for offline GCRL that enables analogy transduction beyond seen pairs to unseen combinations. We empirically demonstrate the effectiveness of our approach on OGBench manipulation environments, substantially outperforming prior methods that do not perform analogy transduction. Project page: https://rllab-snu.github.io/projects/CTA/

Junseok Kim, Dohyeong Kim, Mineui Hong, Songhwai Oh• 2026

Related benchmarks

Task	Dataset	Result
Goal-conditioned manipulation	OGBench puzzle-4x4-play	Score0.84	24
Manipulation	OGBench cube-triple-play	Success Rate17	19
Offline Goal-Conditioned Reinforcement Learning	humanoidmaze large-navigate v0	Success Rate60	14
Goal-conditioned manipulation	OGBench scene-play	Success Rate90	12
Goal-conditioned manipulation	OGBench puzzle-4x5-play	Success Rate17	12
Goal-conditioned manipulation	OGBench puzzle 3x3-play	Success Rate94	12
Goal-conditioned manipulation	OGBench puzzle-4x6-play	Success Rate12	12
Goal-conditioned manipulation	OGBench cube single-play	Success Rate86	12
Offline Goal-Conditioned Reinforcement Learning	OGBench humanoidmaze-medium-navigate	Success Rate90	7
Offline Goal-Conditioned Reinforcement Learning	OGBench antmaze-giant-navigate	Success Rate54	7

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord