Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

UTOPIA: Unlearnable Tabular Data via Decoupled Shortcut Embedding

About

Unlearnable examples (UE) have emerged as a practical mechanism to prevent unauthorized model training on private vision data, while extending this protection to tabular data is nontrivial. Tabular data in finance and healthcare is highly sensitive, yet existing UE methods transfer poorly because tabular features mix numerical and categorical constraints and exhibit saliency sparsity, with learning dominated by a few dimensions. Under a Spectral Dominance condition, we show certified unlearnability is feasible when the poison spectrum overwhelms the clean semantic spectrum. Guided by this, we propose Unlearnable Tabular Data via DecOuPled Shortcut EmbeddIng (UTOPIA), which exploits feature redundancy to decouple optimization into two channels: high saliency features for semantic obfuscation and low saliency redundant features for embedding a hyper correlated shortcut, yielding constraint-aware dominant shortcuts while preserving tabular validity. Extensive experiments across tabular datasets and models show UTOPIA drives unauthorized training toward near random performance, outperforming strong UE baselines and transferring well across architectures.

Jiaming He, Fuming Luo, Hongwei Li, Wenbo Jiang, Wenshu Fan, Zhenbo Shi, Xudong Jiang, Yi Yu• 2026

Related benchmarks

TaskDatasetResultRank
Binary ClassificationCH (test)
Accuracy61.02
64
Classificationdry-bean (test)
Accuracy25.36
39
Binary ClassificationKC1 (test)
Accuracy46.54
32
Binary ClassificationEmployee (test)
Accuracy49.26
32
Multi-class classificationJV (test)
Accuracy31.05
32
Multi-class classificationIF (test)
Accuracy42.39
32
Multi-class classificationEOL (test)
Accuracy18.34
32
Showing 7 of 7 rows

Other info

Follow for update