Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PASS: Ambiguity Guided Subsets for Scalable Classical and Quantum Constrained Clustering

About

Pairwise-constrained clustering augments unsupervised partitioning with side information by enforcing must-link (ML) and cannot-link (CL) constraints between specific samples, yielding labelings that respect known affinities and separations. However, ML and CL constraints add an extra layer of complexity to the clustering problem, with current methods struggling in data scalability, especially in niche applications like quantum or quantum-hybrid clustering. We propose PASS, a pairwise-constraints and ambiguity-driven subset selection framework that preserves ML and CL constraints satisfaction while allowing scalable, high-quality clustering solution. PASS collapses ML constraints into pseudo-points and offers two selectors: a constraint-aware margin rule that collects near-boundary points and all detected CL violations, and an information-geometric rule that scores points via a Fisher-Rao distance derived from soft assignment posteriors, then selects the highest-information subset under a simple budget. Across diverse benchmarks, PASS attains competitive SSE at substantially lower cost than exact or penalty-based methods, and remains effective in regimes where prior approaches fail.

Pedro Chumpitaz-Flores, My Duong, Ying Mao, Kaixun Hua• 2026

Related benchmarks

TaskDatasetResultRank
ClusteringIris
SSE95.87
24
ClusteringSEEDS
SSE673.4
22
Pairwise-constrained clusteringSeeds UCI (full)
SSE600.2
15
ClusteringAn blobs (n=500, m=2, k=3)
SSE153.4
15
Pairwise-constrained clusteringIris UCI (full)
SSE83.72
15
ClusteringRaisins
SSE1.28e+12
14
Pairwise-constrained clusteringHemi
SSE1.40e+7
14
ClusteringLand mine (n=338, d=3, k=5)
SSE26.96
13
Pairwise-constrained clusteringPR2392
SSE2.58e+10
13
Pairwise-constrained clusteringRDS CNT
SSE2.95e+7
13
Showing 10 of 37 rows

Other info

Follow for update