Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings

About

Unsupervised domain adaptation (UDA) involves learning class semantics from labeled data within a source domain that generalize to an unseen target domain. UDA methods are particularly impactful for semantic segmentation, where annotations are more difficult to collect than in image classification. Despite recent advances in large-scale vision-language representation learning, UDA methods for segmentation have not taken advantage of the domain-agnostic properties of text. To address this, we present a novel Covariance-based Pixel-Text loss, CoPT, that uses domain-agnostic text embeddings to learn domain-invariant features in an image segmentation encoder. The text embeddings are generated through our LLM Domain Template process, where an LLM is used to generate source and target domain descriptions that are fed to a frozen CLIP model and combined. In experiments on four benchmarks we show that a model trained using CoPT achieves the new state of the art performance on UDA for segmentation. The code can be found at https://github.com/cfmata/CoPT.

Cristina Mata, Kanchana Ranasinghe, Michael S. Ryoo• 2025

Related benchmarks

TaskDatasetResultRank
Semantic segmentationGTA5 → Cityscapes (val)
mIoU76.1
533
Semantic segmentationCityscapes trained on SYNTHIA (val)
Road IoU83.4
60
Showing 2 of 2 rows

Other info

Follow for update