SPAN: Spatial Pyramid Attention Network forImage Manipulation Localization
About
We present a novel framework, Spatial Pyramid Attention Network (SPAN) for detection and localization of multiple types of image manipulations. The proposed architecture efficiently and effectively models the relationship between image patches at multiple scales by constructing a pyramid of local self-attention blocks. The design includes a novel position projection to encode the spatial positions of the patches. SPAN is trained on a generic, synthetic dataset but can also be fine tuned for specific datasets; The proposed method shows significant gains in performance on standard datasets over previous state-of-the-art methods.
Xuefeng Hu, Zhihan Zhang, Zhenye Jiang, Syomantak Chaudhuri, Zhenheng Yang, Ram Nevatia• 2020
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Manipulation Localization | NIST16 | F1 Score83.59 | 42 | |
| Pixel-level Manipulation Detection | Columbia | F1 Score77.4 | 34 | |
| Pixel-level Manipulation Detection | NIST | F1 Score68.3 | 34 | |
| Pixel-level Manipulation Detection | COVER | F1 Score71.8 | 34 | |
| Pixel-level Manipulation Detection | DEFACTO 12k | F1 Score57.1 | 32 | |
| Image Manipulation Localization | NIST 16 | AUC0.84 | 31 | |
| Image Forgery Detection | Columbia | AUC0.999 | 25 | |
| Image Forgery Detection | Coverage | AUC0.67 | 25 | |
| Image Forgery Detection | DSO-1 | AUC66.9 | 25 | |
| Pixel-level Manipulation Detection | CASIA v1+ | F1 Score68.8 | 22 |
Showing 10 of 65 rows