Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SOMtime the World Ain$'$t Fair: Violating Fairness Using Self-Organizing Maps

About

Unsupervised representations are widely assumed to be neutral with respect to sensitive attributes when those attributes are withheld from training. We show that this assumption is false. Using SOMtime, a topology-preserving representation method based on high-capacity Self-Organizing Maps, we demonstrate that sensitive attributes such as age and income emerge as dominant latent axes in purely unsupervised embeddings, even when explicitly excluded from the input. On two large-scale real-world datasets (the World Values Survey across five countries and the Census-Income dataset), SOMtime recovers monotonic orderings aligned with withheld sensitive attributes, achieving Spearman correlations of up to 0.85, whereas PCA and UMAP typically remain below 0.23 (with a single exception reaching 0.31), and against t-SNE and autoencoders which achieve at most 0.34. Furthermore, unsupervised segmentation of SOMtime embeddings produces demographically skewed clusters, demonstrating downstream fairness risks without any supervised task. These findings establish that \textit{fairness through unawareness} fails at the representation level for ordinal sensitive attributes and that fairness auditing must extend to unsupervised components of machine learning pipelines. We have made the code available at~ https://github.com/JosephBingham/SOMtime

Joseph Bingham, Netanel Arussy, Dvir Aran• 2026

Related benchmarks

TaskDatasetResultRank
Sensitive attribute leakageWVS China 7
Pearson Correlation0.8
5
Sensitive attribute leakageWVS USA 7
Pearson Correlation0.8
5
Sensitive attribute leakageCensus KDD Age 1.0
Pearson Correlation0.5
5
Sensitive attribute leakageCensus (KDD) Income 1.0
Pearson Correlation0.48
5
Sensitive attribute leakageCensus (KDD) Capital Gains 1.0
Pearson Correlation (r)0.34
5
Sensitive attribute leakageWVS 7 (Canada)
Pearson Correlation0.75
5
Sensitive attribute leakageWVS Romania 7
Pearson Corr0.66
5
Sensitive attribute leakageWVS Germany 7
Pearson Correlation0.79
5
Showing 8 of 8 rows

Other info

Follow for update