SPAN: Spatial Pyramid Attention Network forImage Manipulation Localization

About

We present a novel framework, Spatial Pyramid Attention Network (SPAN) for detection and localization of multiple types of image manipulations. The proposed architecture efficiently and effectively models the relationship between image patches at multiple scales by constructing a pyramid of local self-attention blocks. The design includes a novel position projection to encode the spatial positions of the patches. SPAN is trained on a generic, synthetic dataset but can also be fine tuned for specific datasets; The proposed method shows significant gains in performance on standard datasets over previous state-of-the-art methods.

Xuefeng Hu, Zhihan Zhang, Zhenye Jiang, Syomantak Chaudhuri, Zhenheng Yang, Ram Nevatia• 2020

Related benchmarks

Task	Dataset	Result
Image Manipulation Localization	NIST16	F1 Score83.59	93
Image Manipulation Localization	Coverage	F1 Score55.8	78
Image Manipulation Localization	Columbia	--	60
Image Forgery Detection	DSO-1	AUC66.9	41
Tamper Localization	Columbia	IoU14	36
Image Forgery Localization	DSO-1	F1 Score0.059	35
Pixel-level Manipulation Detection	Columbia	F1 Score77.4	34
Pixel-level Manipulation Detection	NIST	F1 Score68.3	34
Pixel-level Manipulation Detection	COVER	F1 Score71.8	34
Pixel-level Manipulation Detection	DEFACTO 12k	F1 Score57.1	32

Showing 10 of 71 rows

...

Other info

Follow for update

@wizwand_team Discord