Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Transport Clustering: Solving Low-Rank Optimal Transport via Clustering

About

Optimal transport (OT) finds a least cost transport plan between two probability distributions using a cost matrix defined on pairs of points. Unlike standard OT, which infers unstructured pointwise mappings, low-rank optimal transport explicitly constrains the rank of the transport plan to infer latent structure. This improves statistical stability and robustness, yields sharper parametric rates for estimating Wasserstein distances adaptive to the intrinsic rank, and generalizes $K$-means to co-clustering. These advantages, however, come at the cost of a non-convex and NP-hard optimization problem. We introduce transport clustering, an algorithm to compute a low-rank OT plan that reduces low-rank OT to a clustering problem on correspondences obtained from a full-rank $\textit{transport registration}$ step. We prove that this reduction yields polynomial-time, constant-factor approximation algorithms for low-rank OT: specifically, a $(1+\gamma)$ approximation for negative-type metrics and a $(1+\gamma+\sqrt{2\gamma}\,)$ approximation for kernel costs, where $\gamma \in [0,1]$ denotes the approximation ratio of the optimal full-rank solution relative to the low-rank optimal. Empirically, transport clustering outperforms existing low-rank OT solvers on synthetic benchmarks and large-scale, high-dimensional datasets.

Henri Schmidt, Peter Halmos, Ben Raphael• 2026

Related benchmarks

TaskDatasetResultRank
Wasserstein Distance EstimationFragmented hypercube
Absolute Error (W₂²)0.242
40
Low-rank Optimal TransportCIFAR-10
OT Cost231.2
3
Low-rank Optimal TransportMouse embryo E8.5 -> E8.75
OT Cost0.506
3
Single-cell transcriptomics alignmentSingle-cell mouse embryo transcriptomics E8.5 → E8.75
OT Cost0.506
3
Single-cell transcriptomics alignmentSingle-cell mouse embryo transcriptomics E8.75 → E9.0
OT Cost0.384
3
Low-rank Optimal TransportMouse embryo E9.5 -> E9.75
OT Cost0.389
2
Single-cell transcriptomics alignmentSingle-cell mouse embryo transcriptomics E9.0 → E9.25
OT Cost0.452
2
Single-cell transcriptomics alignmentSingle-cell mouse embryo transcriptomics E9.25 → E9.5
OT Cost0.411
2
Single-cell transcriptomics alignmentSingle-cell mouse embryo transcriptomics E9.5 → E9.75
OT Cost0.389
2
Single-cell transcriptomics alignmentSingle-cell mouse embryo transcriptomics E9.75 → E10.0
OT Cost0.361
2
Showing 10 of 10 rows

Other info

Follow for update