Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

COLLIE: Guiding Skill Discovery in Semantically Coherent Latent Space

About

Unsupervised skill discovery (USD) aims to learn diverse behaviors without reward functions, but often results in task-irrelevant or hazardous behaviors due to uniform exploration. Guided skill discovery (GSD) addresses this issue by incorporating human intent to focus exploration on meaningful regions. However, existing GSD methods typically require training additional guidance models, and rely on pre-defined rules or expert demonstration, which can be ineffective under sparse, online-collected human feedback. To overcome this, we propose COLLIE, a GSD framework that leverages dense unsupervised data to construct a semantically coherent skill latent space. This latent space is well-structured, enabling reliable guidance with sparse online feedback. Moreover, its semantic coherence property enables training-free construction of guidance signals, eliminating the need for additional model training beyond skill learning. Theoretical analysis justifies the effectiveness of our training-free guidance signal, while experiments across diverse state-based and pixel-based tasks show that COLLIE learns diverse, human-aligned skills, avoids hazardous behaviors, and achieves superior downstream performance with minimal human feedback.

Yao Luan, Ni Mu, Hanfei Ge, Yiqin Yang, Bo Xu, Qing-Shan Jia• 2026

Related benchmarks

TaskDatasetResultRank
Safe LocomotionAnt North
Safe State Ratio96.9
7
Safe LocomotionQuadruped North
Safe State Ratio87.6
7
Safe LocomotionSafety-Gym Hazard
Safe State Ratio40
7
Downstream Task PerformanceAnt Hole
Average Performance (Ant Hole)650.7
7
Hierarchical ControlHalfcheetah
Performance Score45.26
7
Safe LocomotionAnt Range
Safe State Ratio80.9
7
Safe LocomotionHalfCheetah Right
Safe State Ratio98.7
7
Safe LocomotionAnt Range-North
Safe State Ratio81.4
7
Safe LocomotionAnt Hole
Safe State Ratio90.6
7
Safe LocomotionAnt Hole-North
Safe State Ratio93.1
7
Showing 10 of 14 rows

Other info

Follow for update