Raising the Cost of Malicious AI-Powered Image Editing

About

We present an approach to mitigating the risks of malicious image editing posed by large diffusion models. The key idea is to immunize images so as to make them resistant to manipulation by these models. This immunization relies on injection of imperceptible adversarial perturbations designed to disrupt the operation of the targeted diffusion models, forcing them to generate unrealistic images. We provide two methods for crafting such perturbations, and then demonstrate their efficacy. Finally, we discuss a policy component necessary to make our approach fully effective and practical -- one that involves the organizations developing diffusion models, rather than individual users, to implement (and support) the immunization process.

Hadi Salman, Alaa Khaddaj, Guillaume Leclerc, Andrew Ilyas, Aleksander Madry• 2023

Related benchmarks

Task	Dataset	Result
Talking Head Generation	HDTF (test)	FID107.2	73
Identity Protection	CelebA (test)	ISM75.8	48
Portrait Privacy Protection	SyncTalk-generated videos (test)	PSNR28.1	45
Identity Protection	VGG-Face (test)	FDR0.624	32
Face Swapping Defense	CelebA-HQ (test)	--	30
Face Swapping Protection	CelebA-HQ	L2 Distance0.0141	28
Face Swapping Protection	VGGFace2 HQ	L2 Distance0.0263	28
Adversarial Face Protection	Face protection evaluation set	Denoising Loss0.1713	27
Image Quality Evaluation	CelebA-HQ	PSNR29.764	25
Image Immunization	InstructPix2Pix Original Prompt	PSNR17.83	16

Showing 10 of 66 rows

Other info

Follow for update

@wizwand_team Discord