Have you been faked?

jaclaz · on March 29, 2019

>You have probably seen https://thispersondoesnotexist.com/ by Philip Wang.

>But have you ever wondered if they made photos that look like you? Let's check!

>We collected the huge dataset of 428526 fake generated photos and extracted their facial parameters with https://github.com/ageitgey/face_recognition. Now you can match your image against fake faces and compare with the closest matches. Enjoy!

Maybe it is just me, but how would I go if I wanted to harvest large numbers of "real" photos of "real" persons?

Having curious people uploading them to my website could be an easy way.

yorwba · on March 29, 2019

https://thispersondoesnotexist.com/ only exists because someone already collected enough real photos of real persons to have a neural network learn their distribution well enough to generate new samples.

Harvesting even more photos is not really necessary at this point, and in any case, scraping them off the web would be faster than creating a novelty website.

jaclaz · on March 29, 2019

>Harvesting even more photos is not really necessary at this point, and in any case, scraping them off the web would be faster than creating a novelty website.

But scraping them from the web says nothing about the source, even if you manage to remove all stock photos.

This way it is IMHO more likely that it is a "real" photo, most probably uploaded by a "real" user and the site has also the IP of the sender.

Morover, most photos you can find on the web have had their EXIF information removed by the host, maybe it is not the case for a casual user.

As I see it scraping them off the web is good for quantity but not so much for quality, this (completely hypothetical) approach would give less quantity but IMHO better quality data.

C4stor · on March 29, 2019

I definitely tried it with more junk images that I had in stock than real photos, so, there's that. There's moment where you need to know what face a computer would say resemble the most a sushi... So I'm not 100% sure about quality

devb · on March 29, 2019

> how would I go if I wanted to harvest large numbers of "real" photos of "real" persons?

I guess you could just steal them from Flickr?

http://fortune.com/2019/03/12/millions-of-flickr-photos-were...

brlewis · on March 29, 2019

>Maybe it is just me, but how would I go if I wanted to harvest large numbers of "real" photos of "real" persons?

Use APIs or scraping to collect profile photos from facebook, twitter, google, gravatar, etc. You'll get a lot of non-person photos, but havetheyfaked.me probably does too.

jaclaz · on March 29, 2019

Yep, but as said above that would be "big data" with the need of de-duplicating them and with no additional (reliable) parameters.

Then you will need some AI (or whatever) to remove non-human photos or non-suitable photos (position, lighting, etc.) whilst this method would almost guarantee only "portraits" or "upper torso" pictures of humans.

What I tried to say wasn't that this is the "only" method, but that it is one of the "easy" ones with a high probability of getting "reliable" data.

navane · on March 29, 2019

Maybe all the images from thispersondoesnotexist.com are coming from havetheyfaked.me?

johnbatch · on March 29, 2019

What happened here? https://havetheyfaked.me/d/ed0949b5-7a05-4614-8b0e-267e4458a...

Also I got told I matched with this https://havetheyfaked.me/d/c4662bf7-eb98-4c7f-a59d-6c96c9c1f...

MayeulC · on March 29, 2019

There seems to be plenty of less-than-pleasant outputs from this net. I had a couple of strange ones when I landed: [1][2]. If those were generated with GAN, the adversarial network could really use another training pass with examples like this (and if GANs are not used, there could be a simple network filtering the first net's output).

[1]: https://havetheyfaked.me/d/539e20e9-560c-4a8d-a2ba-ca89e2c9a...

[2]: https://havetheyfaked.me/d/86526a1d-bbe7-4956-8bec-a7e64e082...

itronitron · on March 29, 2019

They are trying to match photos to the top right expression on that page (the 2nd link)

smoyer · on March 29, 2019

I hate to be the one to break the news to you but ... you are a zombie (it's the only way they'd match your photo to pictures of computer-generated undead).

deepstream · on March 29, 2019

I love this comment mood. Everyone losing it when some random small website asks people to upload their photo if they want to find out some random information. But everyone's cool with sharing constantly on FB, IG, uploading their photos and locations to massive companies, etc etc. The internet has reality distorted everyone's perspective. Orwell would be have something to say about it. Everyone loves Big Brother (FAANG etc) but distrusts each other. Divide and conquer much?

nkrisc · on March 29, 2019

I think you'll find an above average amount of people here who despise FAANG and haven't given them anything.

Kye · on March 29, 2019

There's no reason to assume people commenting here about risks use any of those sites and post their photos to them.

Yajirobe · on March 29, 2019

There is no reason to assume that they don't.

aflag · on March 29, 2019

There is. It would be inconsistent of them.

Crinus · on March 29, 2019

And people generally tend to be inconsistent in what they say and what they do.

lm28469 · on March 29, 2019

It's almost has if big companies were regulated but personal websites were not.

> when some random small website

It could be some random website. It could also be a 3 letters agency website training airport security face detection, a social engineering website, a random website storing images in a public s3 bucket.

> The internet has reality distorted everyone's perspective.

You give all your personal / biometric info when you get a new passport. If some rando guy were asking for your biometric data in a street would you give him ?

mamon · on March 29, 2019

>> It could also be a 3 letters agency website training airport security face detection

3 letter agencies will kindly ask/force Facebook to share their face-detection data and algorithms, no need to do this themselves.

In fact I think the biggest co-conspirator of NSA and other agencies is Apple. First they put fingerprint sensor inside Home button, so you can't avoid using it, and have to share your fingerprints whether you like it or not. Now that they captured enough fingerprints they move to FaceID, which sensors are so conveniently placed that you cant avoid having your face scanned, even if you don't ever set up FaceID for your own use.

deepstream · on March 30, 2019

you could tape over the aperture

Mc_Big_G · on March 29, 2019

Everyone = Everyone - me

thatguyagain · on March 29, 2019

Where does my face end up when I submit it to this website?

wastemaster · on March 29, 2019

This is a good question. the thing is that except you photo I have to extract its vector of facial parameters to match fakes. So with this 128-dimentional vector I can find you on other photos (if I had some)

All the pics and their vectors stay on this small server. if you want to be deleted please drop me a letter wastemaster@gmail.com

plibither8 · on March 29, 2019

> All the pics and their vectors stay on this small server.

This should be mentioned on the website.

> if you want to be deleted please drop me a letter wastemaster@gmail.com

There should be an easier mechanism to request a deletion of our photo. Better still, request permission from the user to store the photo in your servers before actually storing them.

I think this is the bare minimum of transparency that should required before letting people upload personal data, especially in this day and age.

pocketarc · on March 29, 2019

Not to mention that the website is accessible from the EU, and you're required -by law- to obtain consent to store this personal data, and to tell people what exactly you're going to do with it, and with whom you're sharing it (if anyone).

I know everyone's used to the wild west but I'm glad that's changing, because of comments like yours - this transparency should NOT be something done out of the website owner's good heart (because as we've seen, most will just give us the finger), but enforced by law.

Edit: For the record, wastemaster's actually quite nice, and this is not directed at them, just websites in general.

deytempo · on March 29, 2019

How exactly would anyone in the EU prosecute someone outside of it for running their own website if that individual does so outside of the EU and does not have any organization or company they are affiliated with. Just because something is accessible from the EU does not make it under their jurisdiction to police.

icebraining · on March 29, 2019

The EU claims jurisdiction based on the fact that part of the interaction occurred in the EU, so they can fine you (it should be noted that the GDPR applies to data related to people in the EU, not related to EU citizens living elsewhere). Whether they can collect on those fines is a different matter.

deytempo · on March 29, 2019

How do they intend to fine non EU residents hosting a website outside of the EU? I could see if it was a company but if someone is running a server with a not for profit site on it with no way to identify the site owner and an EU resident visits it, good luck trying to fine anyone. The EU does not own or even control the internet outside of their borders.

deytempo · on March 29, 2019

After doing some research it appears that only businesses and organizations are responsible for compliance with GDPR

jodrellblank · on March 29, 2019

and you're required -by law-

Does the GDPR apply to non-commercial, non-business, non-organizations?

plibither8 · on March 29, 2019

Yes. If the organisation/company/service is processing the data of the users, GDPR applies.

https://gdprexplained.eu/who-has-to-comply/

sneak · on March 29, 2019

Why do you care what happens with a photo of your face? Many thousands of them exist; you probably have a profile photo on gravatar, or linkedin, or twitter somewhere anyway, to say nothing of the many thousands upon thousands of pictures of your face captured in frames on surveillance camera footage.

You provide this information (a picture of your face) to every convenience store, casino, bank, airport, and office building you walk into, many hundreds of times per day, for permanent storage. What is the threat model here from someone with a webserver having a single picture of your face with no other associated identifying information about it?

wastemaster · on March 29, 2019

Agree, thank you. Somehow I missed that, yep going to add with next update

jstanley · on March 29, 2019

The fact that you don't want to immediately delete the data after processing is a cause for concern.

I can't think of any reason not to immediately delete the data, other than that you intend to use it for something else in the future.

That said, I appreciate your honesty. If you had actually nefarious intentions, you would presumably just claim that you deleted the data when you don't

botto · on March 29, 2019

I would honestly suggest just deleting the photo after use, there is such a big minefield with something like this and the EU

nevertoolate · on March 29, 2019

Can’t you just remove all the stored data?

wastemaster · on March 29, 2019

Already doing this! Cleaning up all the uploaded info in 3 minutes

jstanley · on March 29, 2019

Why don't you delete them all automatically as soon as you've finished processing them?

wastemaster · on March 29, 2019

Already doing this! Cleaning up all the uploaded info in 3 minutes after upload!

stfwn · on March 29, 2019

> All the pics and their vectors stay on this small server.

Not to take a jab at you specifically, but the fact that someone that can make websites like these is ignorant enough about privacy (law) to casually drop this line marks a worrying development in the accessibility of AI tech.

For impactful technologies, we probably want the required domain knowledge to come with some structural social disciplining so that we can collectively steer that impact in the right direction (whatever that is). Clearly AI libraries have become so easy to use professional ethics are not part of the curriculum.

jaclaz · on March 29, 2019

>This is a good question. the thing is that except you photo I have to extract its vector of facial parameters to match fakes. So with this 128-dimentional vector I can find you on other photos (if I had some)

And - with all due respect - an alternative would be providing an open-source program to create offline this "128-dimentional vector" and upload only this latter and NOT the photo.

hnaccy · on March 29, 2019

Uh why wouldn't they be automatically trashed after processing?

homarp · on March 29, 2019

can you do the extraction client side, and only send the vector ?

wastemaster · on March 29, 2019

Sending this vector is somewhat even worse than just photo. This vector info would allow someone to match your face without your initial photo! (array of 128 floats requires less storage and less transparence)

For example if I extract color map from your photo is this your personal data still?

kacamak · on March 29, 2019

On thispersondoesnotexist.com

discobean · on March 29, 2019

So, anybody have the balls to upload their photo yet?

Harvey-Specter · on March 29, 2019

What do you imagine is going to happen if you upload your photo?

oppressedgf · on March 29, 2019

Everyone's getting absolutely bloody terrified about uploading their photos... What do you think the creator's gonna do, drive to your house and kill you, knowing nothing but what you look like?

jimijazz · on March 29, 2019

I did, but the matches do not look like me.

pgt · on March 29, 2019

This would be cool tech for finding single people you are attracted to based on a reference image.

thatguyagain · on March 29, 2019

This is a really cool idea

lkbm · on March 29, 2019

None of the matches looked a ton like me, imo, but I tried five photos of myself from 2011-2018 and a couple matches kept reoccurring -- not as top match, but in the top few, so that was interesting.

I didn't trim my beard or mustache from 2009 - 2012, when I shaved it off entirely, and I didn't cut my hair from 1996 - 2017, so there's some good variation in these pictures.

p2detar · on March 29, 2019

Well I'll be! It really works. [1]

Joke aside, I really enjoy both of these projects. It's fascinating that you can put a story behind every "fake" face.

1 - https://havetheyfaked.me/u/f/7b8ec517-c5a6-4e43-8bae-5be69ee...

lawlessone · on March 29, 2019

was very worrried this was going to be like "haveibeenpwnd" glad to see it isn't

wastemaster · on March 29, 2019

haha well, I was looking at them too. I'm not a native speaker, so considered to use at least their name scheme haveibeenfaked.co or something. Fortunately chosen current option

gsich · on March 29, 2019

This needs to be open-sourced and self-hostable. (if it works, haven't uploaded anything)

w-m · on March 29, 2019

They haven't faked Matt Damon yet. Thank god!

https://havetheyfaked.me/u/f/ab60f524-6111-41a4-afef-211d299...

giancarlostoro · on March 29, 2019

One of the similar faces has a kid that could be his illegitimate son though.

kingkawn · on March 29, 2019

Am I the only one deeply disturbed by the emotional presence of these generated “people”?

sergiotapia · on March 29, 2019

I'm freaked out that literally non of these people exist - yet they're there. I can see them. Some eldritch shit yo.

paradigmshiv · on March 29, 2019

The teeth is what bothers me

wastemaster · on March 29, 2019

Got two gdpr complaints, so I added automated removal of uploaded photos and metadata extracted from those photos.

Now photos and data are only stored for 3 minuites, then deleted (app artitecture requires server side processing of uploaded file and to compare results with your photo I have to store it - to be able to show)

Also reflected information above on the website itself. Thank you for your interest to this project!

accountwasmade · on March 29, 2019

Well done. Does this apply to all images that have ever been uploaded to the website, or only to the ones uploaded after your update? If an user uploaded an image 2 hours ago, will their image be stored or has it been deleted already?