Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Propensity Score Alignment of Unpaired Multimodal Data

About

Multimodal representation learning techniques typically rely on paired samples to learn common representations, but paired samples are challenging to collect in fields such as biology where measurement devices often destroy the samples. This paper presents an approach to address the challenge of aligning unpaired samples across disparate modalities in multimodal representation learning. We draw an analogy between potential outcomes in causal inference and potential views in multimodal observations, which allows us to use Rubin's framework to estimate a common space in which to match samples. Our approach assumes we collect samples that are experimentally perturbed by treatments, and uses this to estimate a propensity score from each modality, which encapsulates all shared information between a latent state and treatment and can be used to define a distance between samples. We experiment with two alignment techniques that leverage this distance -- shared nearest neighbours (SNN) and optimal transport (OT) matching -- and find that OT matching results in significant improvements over state-of-the-art alignment approaches in both a synthetic multi-modal setting and in real-world data from NeurIPS Multimodal Single-Cell Integration Challenge.

Johnny Xi, Jana Osea, Zuheng Xu, Jason Hartford• 2024

Related benchmarks

TaskDatasetResultRank
Cross-modal predictionCITE-seq (test)
Median R^20.233
7
Multimodal alignmentCITE-seq (test)
FOSCTTM0.3126
7
Multimodal alignmentSynthetic interventional image dataset (test)
MSE0.0316
6
Cross-modal predictionPerturbSeq (In Distribution)
Wasserstein-1 Distance4.199
3
Cross-modal predictionPerturbSeq Out of Distribution
Wasserstein-1 Distance5.394
3
Cross-modal predictionPerturbSeq Single Cell Image Data In Distribution (test)
Median KL Divergence50.967
3
Cross-modal predictionPerturbSeq Single Cell Image Data (Out of Distribution)
KL Divergence43.554
3
Showing 7 of 7 rows

Other info

Follow for update