Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Nonnegative Matrix Factorization through Cone Collapse

About

Nonnegative matrix factorization (NMF) is a widely used tool for learning parts-based, low-dimensional representations of nonnegative data, with applications in vision, text, and bioinformatics. In clustering applications, orthogonal NMF (ONMF) variants further impose (approximate) orthogonality on the representation matrix so that its rows behave like soft cluster indicators. Existing algorithms, however, are typically derived from optimization viewpoints and do not explicitly exploit the conic geometry induced by NMF: data points lie in a convex cone whose extreme rays encode fundamental directions or "topics". In this work we revisit NMF from this geometric perspective and propose Cone Collapse, an algorithm that starts from the full nonnegative orthant and iteratively shrinks it toward the minimal cone generated by the data. We prove that, under mild assumptions on the data, Cone Collapse terminates in finitely many steps and recovers the minimal generating cone of $\mathbf{X}^\top$ . Building on this basis, we then derive a cone-aware orthogonal NMF model (CC-NMF) by applying uni-orthogonal NMF to the recovered extreme rays. Across 16 benchmark gene-expression, text, and image datasets, CC-NMF consistently matches or outperforms strong NMF baselines-including multiplicative updates, ANLS, projective NMF, ONMF, and sparse NMF-in terms of clustering purity. These results demonstrate that explicitly recovering the data cone can yield both theoretically grounded and empirically strong NMF-based clustering methods.

Manh Nguyen, Daniel Pimentel-Alarc\'on (2) __INSTITUTION_2__ Department of Statistics, (2) Department of Biostatistics, Medical Informatics, Wisconsin Institute of Discovery, University of Wisconsin-Madison)• 2025

Related benchmarks

TaskDatasetResultRank
ClusteringCancer--
15
ClusteringUMist--
13
ClusteringORL--
11
ClusteringAMLALL
Purity98
6
ClusteringKHAN
Purity63
6
ClusteringROSETTA
Purity82
6
ClusteringMed
Purity0.59
6
ClusteringCiteseer
Purity35
6
ClusteringWEBKB4
Purity0.43
6
ClusteringCOIL-20
Purity74
6
Showing 10 of 16 rows

Other info

Follow for update