Disrupting Deepfakes: Adversarial Attacks Against Conditional Image Translation Networks and Facial Manipulation Systems

About

Face modification systems using deep learning have become increasingly powerful and accessible. Given images of a person's face, such systems can generate new images of that same person under different expressions and poses. Some systems can also modify targeted attributes such as hair color or age. This type of manipulated images and video have been coined Deepfakes. In order to prevent a malicious user from generating modified images of a person without their consent we tackle the new problem of generating adversarial attacks against such image translation systems, which disrupt the resulting output image. We call this problem disrupting deepfakes. Most image translation architectures are generative models conditioned on an attribute (e.g. put a smile on this person's face). We are first to propose and successfully apply (1) class transferable adversarial attacks that generalize to different classes, which means that the attacker does not need to have knowledge about the conditioning class, and (2) adversarial training for generative adversarial networks (GANs) as a first step towards robust image translation networks. Finally, in gray-box scenarios, blurring can mount a successful defense against disruption. We present a spread-spectrum adversarial attack, which evades blur defenses. Our open-source code can be found at https://github.com/natanielruiz/disrupting-deepfakes.

Nataniel Ruiz, Sarah Adel Bargal, Stan Sclaroff• 2020

Related benchmarks

Task	Dataset	Result
Face Manipulation Adversarial Attack	CelebA	PSNR35.04	28
Face Manipulation Adversarial Attack	FFHQ	PSNR34.729	28
Face Manipulation Adversarial Attack	LFW	PSNR35.417	28
Face Swapping Attack	MegaGAN	ASR37.5	20
Face Swapping Attack	DiffSwap	ASR79.4	20
Adversarial Attack on Face Swapping	SimSwap	ASR9.6	10
Face Swapping Attack	DiffFace	ASR40.2	10
Face Swapping Attack	FaceShifter	ASR15.7	10
Input Image Reconstruction	Deepfake Reconstruction Dataset	MSE24.59	8
Output Image Reconstruction	Deepfake Reconstruction Dataset	MSE43.64	8

Showing 10 of 26 rows

Other info

Follow for update

@wizwand_team Discord