Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Dual Frequency Branch Framework with Reconstructed Sliding Windows Attention for AI-Generated Image Detection

About

The rapid advancement of Generative Adversarial Networks (GANs) and diffusion models has enabled the creation of highly realistic synthetic images, presenting significant societal risks, such as misinformation and deception. As a result, detecting AI-generated images has emerged as a critical challenge. Existing researches emphasize extracting fine-grained features to enhance detector generalization, yet they often lack consideration for the importance and interdependencies of internal elements within local regions and are limited to a single frequency domain, hindering the capture of general forgery traces. To overcome the aforementioned limitations, we first utilize a sliding window to restrict the attention mechanism to a local window, and reconstruct the features within the window to model the relationships between neighboring internal elements within the local region. Then, we design a dual frequency domain branch framework consisting of four frequency domain subbands of DWT and the phase part of FFT to enrich the extraction of local forgery features from different perspectives. Through feature enrichment of dual frequency domain branches and fine-grained feature extraction of reconstruction sliding window attention, our method achieves superior generalization detection capabilities on both GAN and diffusion model-based generative images. Evaluated on diverse datasets comprising images from 65 distinct generative models, our approach achieves a 2.13\% improvement in detection accuracy over state-of-the-art methods.

Jiazhen Yan, Ziqiang Li, Fan Wang, Ziwen He, Zhangjie Fu• 2025

Related benchmarks

TaskDatasetResultRank
Synthetic Image DetectionGlide 50-27
Accuracy98
27
Fake Image DetectionUniversalFakeDetect Guided
Accuracy95.2
13
Fake Image DetectionUniversalFakeDetect LDM_100
Accuracy99.4
13
Fake Image DetectionUniversalFakeDetect LDM_200
Accuracy99.5
13
Fake Image DetectionUniversalFakeDetect LDM_200_cfg
Accuracy98.9
13
Fake Image DetectionUniversalFakeDetect Mean
Accuracy96.4
13
Generated Image DetectionDiffusionForensics Cross-model (test)
DDPM Accuracy99.9
13
Fake Image DetectionUniversalFakeDetect Glide_100_10
Accuracy97.7
13
Fake Image DetectionUniversalFakeDetect Glide_100_27
Accuracy97.2
13
GAN-generated image detectionGANGen-Detection
BEGAN Accuracy98.6
13
Showing 10 of 37 rows

Other info

Follow for update