Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Robust Spectral Watermark for Synthetic Tabular Data

About

The rise of generative AI has enabled the production of high-fidelity synthetic tabular data across fields such as healthcare, finance, and public policy, raising growing concerns about data provenance and misuse. Watermarking offers a promising solution to address these concerns by ensuring the traceability of synthetic data, but existing methods face many limitations: they are computationally expensive due to reliance on the inverse process of large diffusion models, struggle with mixed discrete-continuous data, or lack robustness to common post-processing attacks. To address these limitations, we propose TAB-DRW, an efficient and robust post-editing watermarking scheme for synthetic tabular data. TAB-DRW embeds watermark signals in the frequency domain: it normalizes heterogeneous features via the Yeo-Johnson transformation and standardization, applies the discrete Fourier transform (DFT), and adjusts the imaginary parts of adaptively selected entries according to precomputed pseudorandom bits. To further enhance robustness and efficiency, we introduce a novel rank-based pseudorandom bit generation method that enables row-wise retrieval without incurring storage overhead. Experiments on five benchmark tabular datasets show that TAB-DRW achieves strong detectability and robustness against post-processing and adaptive attacks, while preserving high data fidelity and fully supporting mixed-type features.

Yizhou Zhao, Xiang Li, Peter Song, Qi Long, Weijie Su• 2025

Related benchmarks

TaskDatasetResultRank
Tabular Data Watermarkingmagic
Density91.7
11
Tabular Data WatermarkingDEFAULT
Density92.9
11
Tabular Data WatermarkingDrybean
Density0.931
11
Tabular Data WatermarkingAdult
Density91.5
11
Tabular Data WatermarkingShoppers
Density0.909
11
Watermark robustness against attacksAdult
Error Rate (Row Del. 20%)26.34
10
Watermark robustness against attacksDEFAULT
Row Del. 20% Robustness31.92
5
Watermarking RobustnessShoppers
Row Deletion Robustness38.43
5
Watermarking RobustnessDEFAULT
Robustness: Row Deletion33.92
5
Watermark robustness against attacksShoppers
Performance (Row Del. 20%)36.21
5
Showing 10 of 14 rows

Other info

Follow for update