Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SPAN: Spatial Pyramid Attention Network forImage Manipulation Localization

About

We present a novel framework, Spatial Pyramid Attention Network (SPAN) for detection and localization of multiple types of image manipulations. The proposed architecture efficiently and effectively models the relationship between image patches at multiple scales by constructing a pyramid of local self-attention blocks. The design includes a novel position projection to encode the spatial positions of the patches. SPAN is trained on a generic, synthetic dataset but can also be fine tuned for specific datasets; The proposed method shows significant gains in performance on standard datasets over previous state-of-the-art methods.

Xuefeng Hu, Zhihan Zhang, Zhenye Jiang, Syomantak Chaudhuri, Zhenheng Yang, Ram Nevatia• 2020

Related benchmarks

TaskDatasetResultRank
Image Manipulation LocalizationNIST16
F1 Score83.59
42
Pixel-level Manipulation DetectionColumbia
F1 Score77.4
34
Pixel-level Manipulation DetectionNIST
F1 Score68.3
34
Pixel-level Manipulation DetectionCOVER
F1 Score71.8
34
Pixel-level Manipulation DetectionDEFACTO 12k
F1 Score57.1
32
Image Manipulation LocalizationNIST 16
AUC0.84
31
Image Forgery DetectionColumbia
AUC0.999
25
Image Forgery DetectionCoverage
AUC0.67
25
Image Forgery DetectionDSO-1
AUC66.9
25
Pixel-level Manipulation DetectionCASIA v1+
F1 Score68.8
22
Showing 10 of 65 rows

Other info

Follow for update