Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition

About

High-fidelity generative models are increasingly needed in privacy-sensitive scenarios, where access to data is severely restricted due to regulatory and copyright constraints. This scarcity hampers model development--ironically, in settings where generative models are most needed to compensate for the lack of data. This creates a self-reinforcing challenge: limited data leads to poor generative models, which in turn fail to mitigate data scarcity. To break this cycle, we propose a reinforcement-guided synthetic data generation framework that adapts general-domain generative priors to privacy-sensitive identity recognition tasks. We first perform a cold-start adaptation to align a pretrained generator with the target domain, establishing semantic relevance and initial fidelity. Building on this foundation, we introduce a multi-objective reward that jointly optimizes semantic consistency, coverage diversity, and expression richness, guiding the generator to produce both realistic and task-effective samples. During downstream training, a dynamic sample selection mechanism further prioritizes high-utility synthetic samples, enabling adaptive data scaling and improved domain alignment. Extensive experiments on benchmark datasets demonstrate that our framework significantly improves both generation fidelity and classification accuracy, while also exhibiting strong generalization to novel categories in small-data regimes.

Xuemei Jia, Jiawei Du, Hui Wei, Jun Chen, Joey Tianyi Zhou, Zheng Wang• 2026

Related benchmarks

TaskDatasetResultRank
Person Re-IdentificationMarket-1501 (test)
Rank-194.9
397
Face VerificationLFW
Mean Accuracy93.6
347
Face VerificationCFP-FP
Accuracy73.26
135
Person Re-IdentificationCUHK03 NP (new protocol) (test)
mAP76.6
106
Face VerificationCA-LFW
Accuracy81.68
98
Face VerificationAgeDB
Accuracy76.8
63
Face VerificationCP-LWF
Verification Accuracy70.02
8
Showing 7 of 7 rows

Other info

Follow for update