Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FBA$^2$D: Frequency-based Black-box Attack for AI-generated Image Detection

About

The prosperous development of Artificial Intelligence-Generated Content (AIGC) has brought people's anxiety about the spread of false information on social media. Designing detectors for filtering is an effective defense method, but most detectors will be compromised by adversarial samples. Currently, most studies exposing AIGC security issues assume information on model structure and data distribution. In real applications, attackers query and interfere with models that provide services in the form of application programming interfaces (APIs), which constitutes the black-box decision-based attack paradigm. However, to the best of our knowledge, decision-based attacks on AIGC detectors remain unexplored. In this study, we propose \textbf{FBA$^2$D}: a frequency-based black-box attack method for AIGC detection to fill the research gap. Motivated by frequency-domain discrepancies between generated and real images, we develop a decision-based attack that leverages the Discrete Cosine Transform (DCT) for fine-grained spectral partitioning and selects frequency bands as query subspaces, improving both query efficiency and image quality. Moreover, attacks on AIGC detectors should mitigate initialization failures, preserve image quality, and operate under strict query budgets. To address these issues, we adopt an ``adversarial example soup'' method, averaging candidates from successive surrogate iterations and using the result as the initialization to accelerate the query-based attack. The empirical study on the Synthetic LSUN dataset and GenImage dataset demonstrate the effectiveness of our prosed method. This study shows the urgency of addressing practical AIGC security problems.

Xiaojing Chen, Dan Li, Lijun Peng, Jun Yan{\L}etter, Zhiqing Guo, Junyang Chen, Xiao Lan, Zhongjie Ba, Yunfeng Diao{\L}etter• 2025

Related benchmarks

TaskDatasetResultRank
Black-box AttackLSUN
ASR99.9
189
Black-box AttackGenImage
ASR99.9
162
Adversarial Attack ImperceptibilityImageNet
PSNR36.8
30
Adversarial Attack ImperceptibilityGenImage
PSNR34.7
30
Showing 4 of 4 rows

Other info

Follow for update