Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Disrupting Deepfakes: Adversarial Attacks Against Conditional Image Translation Networks and Facial Manipulation Systems

About

Face modification systems using deep learning have become increasingly powerful and accessible. Given images of a person's face, such systems can generate new images of that same person under different expressions and poses. Some systems can also modify targeted attributes such as hair color or age. This type of manipulated images and video have been coined Deepfakes. In order to prevent a malicious user from generating modified images of a person without their consent we tackle the new problem of generating adversarial attacks against such image translation systems, which disrupt the resulting output image. We call this problem disrupting deepfakes. Most image translation architectures are generative models conditioned on an attribute (e.g. put a smile on this person's face). We are first to propose and successfully apply (1) class transferable adversarial attacks that generalize to different classes, which means that the attacker does not need to have knowledge about the conditioning class, and (2) adversarial training for generative adversarial networks (GANs) as a first step towards robust image translation networks. Finally, in gray-box scenarios, blurring can mount a successful defense against disruption. We present a spread-spectrum adversarial attack, which evades blur defenses. Our open-source code can be found at https://github.com/natanielruiz/disrupting-deepfakes.

Nataniel Ruiz, Sarah Adel Bargal, Stan Sclaroff• 2020

Related benchmarks

TaskDatasetResultRank
Face Manipulation Adversarial AttackCelebA
PSNR35.04
28
Face Manipulation Adversarial AttackFFHQ
PSNR34.729
28
Face Manipulation Adversarial AttackLFW
PSNR35.417
28
Input Image ReconstructionDeepfake Reconstruction Dataset
MSE24.59
8
Output Image ReconstructionDeepfake Reconstruction Dataset
MSE43.64
8
Facial Attribute Editing Defense (StarGAN)FFHQ
L2 Distance0.2501
7
Face Swapping DefenseSimSwap (test)
AUC (JPEG Compression)91.7
7
Facial Attribute Editing Defense (HiSD)CelebA
L2 Distance0.0476
7
Facial Attribute Editing Defense (HiSD)FFHQ
L2 Distance0.0646
7
Face Attribute Editing DefenseStarGAN (test)
AUC (JPEG Compression)0.419
7
Showing 10 of 21 rows

Other info

Follow for update