Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Bridging the Micro--Macro Gap: Frequency-Aware Semantic Alignment for Image Manipulation Localization

About

As generative image editing advances, image manipulation localization (IML) must handle both traditional manipulations with conspicuous forensic artifacts and diffusion-generated edits that appear locally realistic. Existing methods typically rely on either low-level forensic cues or high-level semantics alone, leading to a fundamental micro--macro gap. To bridge this gap, we propose FASA, a unified framework for localizing both traditional and diffusion-generated manipulations. Specifically, we extract manipulation-sensitive frequency cues through an adaptive dual-band DCT module and learn manipulation-aware semantic priors via patch-level contrastive alignment on frozen CLIP representations. We then inject these priors into a hierarchical frequency pathway through a semantic-frequency side adapter for multi-scale feature interaction, and employ a prototype-guided, frequency-gated mask decoder to integrate semantic consistency with boundary-aware localization for tampered region prediction. Extensive experiments on OpenSDI and multiple traditional manipulation benchmarks demonstrate state-of-the-art localization performance, strong cross-generator and cross-dataset generalization, and robust performance under common image degradations.

Xiaojie Liang, Zhimin Chen, Ziqi Sheng, Wei Lu• 2026

Related benchmarks

TaskDatasetResultRank
Image Manipulation LocalizationNIST16
F1 Score40.92
75
Image Manipulation LocalizationCoverage
F1 Score65.37
49
Pixel-level Forgery LocalizationColumbia
F190.57
20
Image-level detectionOpenSDI
SD1.5 F1 Score93.75
15
Image Manipulation LocalizationOpenSDI SD1.5
F1 Score81.19
9
Image Manipulation LocalizationOpenSDI SD2.1
F1 Score72.71
9
Image Manipulation LocalizationOpenSDI SDXL
F1 Score49.14
9
Image Manipulation LocalizationOpenSDI SD3
F1 Score61.74
9
Image Manipulation LocalizationOpenSDI Flux.1
F1 Score24.37
9
Image Manipulation LocalizationOpenSDI Average
F1 Score57.83
9
Showing 10 of 11 rows

Other info

Follow for update