Image Manipulation Detection by Multi-View Multi-Scale Supervision
About
The key challenge of image manipulation detection is how to learn generalizable features that are sensitive to manipulations in novel data, whilst specific to prevent false alarms on authentic images. Current research emphasizes the sensitivity, with the specificity overlooked. In this paper we address both aspects by multi-view feature learning and multi-scale supervision. By exploiting noise distribution and boundary artifact surrounding tampered regions, the former aims to learn semantic-agnostic and thus more generalizable features. The latter allows us to learn from authentic images which are nontrivial to be taken into account by current semantic segmentation network based methods. Our thoughts are realized by a new network which we term MVSS-Net. Extensive experiments on five benchmark sets justify the viability of MVSS-Net for both pixel-level and image-level manipulation detection.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Pixel-level Manipulation Detection | NIST | F1 Score73.7 | 34 | |
| Pixel-level Manipulation Detection | COVER | F1 Score82.4 | 34 | |
| Pixel-level Manipulation Detection | Columbia | F1 Score70.3 | 34 | |
| Pixel-level Manipulation Detection | DEFACTO 12k | F1 Score57.2 | 32 | |
| Image Forgery Detection | Columbia | AUC0.984 | 25 | |
| Image Forgery Detection | Coverage | AUC0.733 | 25 | |
| Image Forgery Detection | DSO-1 | AUC55.2 | 25 | |
| Image Forgery Detection | ForensicHub IFF-Protocol v2025 (test) | FF-c400.713 | 23 | |
| Image-level manipulation detection | CASIA v1+ | AUC0.932 | 19 | |
| Image Manipulation Localization | CocoGlide (test) | F1 Score48.6 | 18 |