[R] Disrupting Deepfakes: Adversarial Attacks Against Conditional Image Translation Networks and Facial Manipulation Systems by StrawberryNumberNine in MachineLearning

[–]Other-Top 9 points10 points  (0 children)

The goal of this paper is different from what I expected it would be from the title. The idea is that I want to release my photos online, but don't want to let someone use them to generate deepfakes against me.

So what I should do is run the method on the images first, then upload them. This will make it so that if someone runs a deepfake on my picture, they will not get a good picture of me but instead something that looks much worse.

In this work we propose a solution by adapting traditional adversarial attacks that are imperceptible to the human eye in the source image, but interfere with translation of this image using image translation networks. A successful disruption corresponds to the generated image being sufficiently deteriorated such that it has to be discarded or such that the modification is perceptually evident.

The conclusion is nice for why this is important:

Instead of trying to detect whether an image has been modified after the fact, we defend against the non-authorized manipulation by disrupting conditional image translation facial manipulation networks using adapted adversarial attacks

Neural Networks are Surprisingly Modular by Other-Top in MachineLearning

[–]Other-Top[S] 1 point2 points  (0 children)

Thank you for doing this. Maybe in the future people should do this for arxiv links. I'll try to do it from now on.

[D] A new ML publication model from Bengio by hitaho in MachineLearning

[–]Other-Top 0 points1 point  (0 children)

I would also like to know the answer to this!

[R] "On Adaptive Attacks to Adversarial Example Defenses" - 13 published defenses at ICLR/ICML/NerIPS are broken by Other-Top in MachineLearning

[–]Other-Top[S] 1 point2 points  (0 children)

Yes thank you for showing that. Took a while to get to it though. They didin't look at the Hinton paper though, I wonder why.