Unsupervised Synthetic Image Attribution: Alignment and Disentanglement
About
As the quality of synthetic images improves, identifying the underlying concepts of model-generated images is becoming increasingly crucial for copyright protection and ensuring model transparency. Existing methods achieve this attribution goal by training models using annotated pairs of synthetic images and their original training sources. However, obtaining such paired supervision is challenging, as it requires either well-designed synthetic concepts or precise annotations from millions of training sources. To eliminate the need for costly paired annotations, in this paper, we explore the possibility of unsupervised synthetic image attribution. We propose a simple yet effective unsupervised method called Alignment and Disentanglement. Specifically, we begin by performing basic concept alignment using contrastive self-supervised learning. Next, we enhance the model's attribution ability by promoting representation disentanglement with the Infomax loss. This approach is motivated by an interesting observation: contrastive self-supervised models, such as MoCo and DINO, inherently exhibit the ability to perform simple cross-domain alignment. By formulating this observation as a theoretical assumption on cross-covariance, we provide a theoretical explanation of how alignment and disentanglement can approximate the concept-matching process through a decomposition of the canonical correlation analysis objective. On the real-world benchmarks, AbC, we show that our unsupervised method surprisingly outperforms the supervised methods. As a starting point, we expect our intuitive insights and experimental findings to provide a fresh perspective on this challenging task.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Synthetic Image Attribution | AbC ImageNet-Seen GPT | R@549.1 | 20 | |
| Synthetic Image Attribution | AbC BAM-FG GPT | R@520.3 | 20 | |
| Synthetic Image Attribution | AbC ImageNet-Unseen GPT | R@584.5 | 20 | |
| Synthetic Image Attribution | AbC ImageNet-Seen Media | R@50.364 | 20 | |
| Synthetic Image Attribution | AbC BAM-FG Object | R@525.1 | 20 | |
| Synthetic Image Attribution | AbC ImageNet-Unseen Media | R@50.633 | 20 | |
| Synthetic Image Attribution | AbC Artchive, GPT | R@522.9 | 20 | |
| Synthetic Image Attribution | AbC Artchive Object | R@517.6 | 20 |