Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Generation Models

About

Text-to-image generation models that generate images based on prompt descriptions have attracted an increasing amount of attention during the past few months. Despite their encouraging performance, these models raise concerns about the misuse of their generated fake images. To tackle this problem, we pioneer a systematic study on the detection and attribution of fake images generated by text-to-image generation models. Concretely, we first build a machine learning classifier to detect the fake images generated by various text-to-image generation models. We then attribute these fake images to their source models, such that model owners can be held responsible for their models' misuse. We further investigate how prompts that generate fake images affect detection and attribution. We conduct extensive experiments on four popular text-to-image generation models, including DALL$\cdot$E 2, Stable Diffusion, GLIDE, and Latent Diffusion, and two benchmark prompt-image datasets. Empirical results show that (1) fake images generated by various models can be distinguished from real ones, as there exists a common artifact shared by fake images from different models; (2) fake images can be effectively attributed to their source models, as different models leave unique fingerprints in their generated images; (3) prompts with the ``person'' topic or a length between 25 and 75 enable models to generate fake images with higher authenticity. All findings contribute to the community's insight into the threats caused by text-to-image generation models. We appeal to the community's consideration of the counterpart solutions, like ours, against the rapidly-evolving fake image generation.

Zeyang Sha, Zheng Li, Ning Yu, Yang Zhang• 2022

Related benchmarks

TaskDatasetResultRank
AI-generated image detectionGenImage
Midjourney Detection Rate97.13
154
Synthetic Image DetectionForenSynths (test)
Mean Accuracy57.24
60
Deepfake AttributionDF40 and FFHQ unseen generators
SimSwap Accuracy2.32
54
AI-generated image detectionGenImage 61 (test)
AUC81.09
45
AttributionWildDeepfake
Accuracy28.76
34
AI-generated image detectionGenImage 1.0 (test)
Midjourney Detection Rate79.88
24
Deepfake AttributionFLUX unseen
Accuracy28.32
20
Deepfake AttributionDiffusionAct unseen
Accuracy23.84
20
AI-generated image detectionDRCT-2M v1.4 (test)
LDM Detection Rate92.1
20
Deepfake AttributionEDTalk unseen
Accuracy12.16
20
Showing 10 of 45 rows

Other info

Follow for update