Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ObjectFormer for Image Manipulation Detection and Localization

About

Recent advances in image editing techniques have posed serious challenges to the trustworthiness of multimedia data, which drives the research of image tampering detection. In this paper, we propose ObjectFormer to detect and localize image manipulations. To capture subtle manipulation traces that are no longer visible in the RGB domain, we extract high-frequency features of the images and combine them with RGB features as multimodal patch embeddings. Additionally, we use a set of learnable object prototypes as mid-level representations to model the object-level consistencies among different regions, which are further used to refine patch embeddings to capture the patch-level consistencies. We conduct extensive experiments on various datasets and the results verify the effectiveness of the proposed method, outperforming state-of-the-art tampering detection and localization methods.

Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Ser-Nam Lim, Yu-Gang Jiang• 2022

Related benchmarks

TaskDatasetResultRank
Image Manipulation LocalizationNIST16
F1 Score87.2
42
Image Manipulation LocalizationNIST 16
AUC0.872
31
Image Manipulation LocalizationCoverage
AUC95.7
16
Image Manipulation LocalizationCOVERAGE (test)
F1 Score75.8
14
Image-level detectionOpenSDID Flux.1 (cross-domain)
F1 Score37.92
14
Image-level detectionOpenSDID cross-domain SD2.1
F1 Score66.79
14
Image-level detectionOpenSDID SDXL (cross-domain)
F1 Score49.19
14
Image-level detectionOpenSDID SD3 cross-domain
F1 Score48.32
14
Image-level detectionOpenSDID Average (aggregated)
F1 Score54.79
14
Image-level detectionOpenSDID SD1.5 (in-domain)
F1 Score71.72
14
Showing 10 of 33 rows

Other info

Follow for update