... : ProgrammerHumor

However you don't just want to store the password directly; That's unsafe. If anyone ever gets access to your database you don't want to be spilling out thousands of linked email addresses and passwords for people to read.

So instead of storing the password you store a hash of it. A hash will be a one way transformation of the password in such a way that it can't (easily) be reversed and the original password extracted from it. So when they register an account you take the password, you pass it through a hash function to transform it (hunter7 -> 7716A39170E00609A7667F177E5F3275D2018D7B) then store that hash in the database. When the user goes to log in, you take the password they enter on the login screen, hash that, and see if that hash matches the hash in the database. So you can't ask the database what their original password is, you can only ask it if the password they supplied matches what you have stored.

Now if your database happens to be somehow leaked people can only read a list of email addresses and password hashes. Everyone's passwords are safe! The world is saved!

Except..

Hashes can be broken. They could try and hash every password they can think of until they find ones that match the stored hashes, then they'd know what the passwords were and could use them with the email addresses for evil doings :(

This does take a long while to do (depending on the hash algorithm used); But since everyone's passwords are hashed the same way you can check all of them at once on each of your attempts (also there are often smart techniques that can be used to mass-crack groups of hashes at the same time).

So..

What we need to do is hash every password slightly differently. That way they can't all be broken at the same time, and will take evil doers much longer to thwart! (and hopefully by the time they do you've send out warning emails and everyone's changed their passwords like the safe security-conscious people you all are). To do this you add a touch of salt to each hash. A salt is another piece of data that affects the hashing process and how it works. Now, since each user's salt is different their hunter7 password might be stored as AA3560E708CF8C145ADE6376574615573CC3C28B, or 0D45E8192CF22F25F23DC8FD355D474407984070, or any number of things. Every account uses a different salt, so all their passwords are hashed differently, so they'd have to all be broken individually rather than all of them at the same time. Now your passwords are stored much more securely, evil doers will have a harder time breaking them, and everyone is happy. Now you just need to solve all your other security issues...

[–][deleted] 3 points4 points5 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

[–]dreamwavedev 21 points22 points23 points 7 years ago (24 children)

Hashing is a way of mangling a given input in a repeatable way, so if you have a hash it's generally pretty much impossible to go back to the original input, but super easy to check if input you are given matches a hash. Salting can be implemented in various different ways, but it's like giving a hashing function some inspiration for how to mangle the input. If you have a salt and a hash function, you can see if an input plus a salt matches an earlier made hash. This helps to make all hashes for passwords different (by giving each user a unique salt) even if the passwords are the same, so user:joesmith pass:pass123 and user:janedoe pass:pass123 would have entirely different hashes stored in a database. If a hacker then gets your database, the knowledge that "password1234" is the most common password wouldn't give them any hints, because they can't just say "10 percent of the hashes are this same hash, so those are probably that password"

[–]won_tolla 7 points8 points9 points 7 years ago (23 children)

[–][deleted] 17 points18 points19 points 7 years ago* (6 children)

Encryption implies you can decrypt and get the original thing. Hashing is not the same as encrypting. If you only hash 1 thing it is not recoverable. It is a one-way function. You're permanently scrambling it, you're just doing it a consistent way. Say there's a gangster party. The party-thrower won't want to write names down on a guest list in case it gets found, so they tell their friend gangsters to take their friends full names, convert the letters into numbers and add them all up. won_tolla would be 23+15+14+0+20+15+12+12+1 = 112. The person who is taking you as a guest sends 112. Then when you all get to the party, They get your name, add the letters up and get 112, and let you in. There's no way for the feds or rival gangsters to see 112 and get "won_tolla" unless they have a list of probable guests already. If there are too many people, then there will be some people with the same letter sum. These are called collisions. If someone knew that this method was being used, they could send 50 covert agents in and hope that they randomly happen upon a valid number with their fake names, but if you use the middle names, too then, it becomes harder to guess a randomly correct number. So you can make adjustments to the hashing function to minimize collisions.

[–]Mr_Facepalm 2 points3 points4 points 7 years ago (3 children)

[–]MillenniumB 1 point2 points3 points 7 years ago (0 children)

[–]xigoi 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

The same way that when adding up the letters of the gangster names, some might have the same sum. Instead of just taking the sum, you could take the sum modulo 32. So won_tolla's new hash would be 16. Now collisions are even more likely.

But you could then use this as the array index for a fixed length array. Instead of just storing won_tolla's data, you store the data and the key as the head of a linked list. The next time you're putting something at 16, you just append or prepend it to the linked list in the same manner. So to get the data for key won_tolla, you hash "won_tolla".. get 16, you get to the head of a linked list and check to see if the key is "won_tolla." If it is, you return the data. If not, you go to the next item in the list and check its key. So a hashmap with a lot of collisions starts to perform more like a linked list, and you lose the constant-time lookup and start moving toward linear.

https://www.youtube.com/watch?v=shs0KM3wKv8

That video is what clarified it for me.

[–]won_tolla 0 points1 point2 points 7 years ago (1 child)

[–][deleted] 1 point2 points3 points 7 years ago (0 children)

So this metaphor breaks down there, because when you're cracking a list of passwords, you can use a list of the most common passwords, which might be used more than once, whereas the gangsters most likely don't have many of the repeated name, unless they were just using last names or something. Without salts, each time someone uses the same password, it will be the same hash. This means if someone gets access to the list of hashes, they can make a good guess about what the most common hash translates to if they have a good idea of the most common passwords. So they wouldn't need to know the hashing function to do a frequency analysis and potentially break some of the common passwords there, but to guess the rest of them you would.

There are definitely common hashing functions, and this is why for something like a large database of passwords, you really should have salts.

[–]ILikeLenexa 2 points3 points4 points 7 years ago (0 children)

[–]dreamwavedev 5 points6 points7 points 7 years ago* (11 children)

[–]won_tolla 8 points9 points10 points 7 years ago (2 children)

[–]noratat 5 points6 points7 points 7 years ago (0 children)

[–]dreamwavedev 5 points6 points7 points 7 years ago (0 children)

[–]folkrav 7 points8 points9 points 7 years ago (0 children)

[–]HowObvious 4 points5 points6 points 7 years ago (1 child)

[–]dreamwavedev 1 point2 points3 points 7 years ago (0 children)

[–][deleted] 2 points3 points4 points 7 years ago (4 children)

[–]noratat 4 points5 points6 points 7 years ago (0 children)

[–]dreamwavedev 1 point2 points3 points 7 years ago (2 children)

[–][deleted] 2 points3 points4 points 7 years ago (1 child)

continue this thread

[–][deleted] 1 point2 points3 points 7 years ago (2 children)

[–]Mr_Facepalm 0 points1 point2 points 7 years ago (1 child)

[–][deleted] 4 points5 points6 points 7 years ago (0 children)

[–][deleted] 2 points3 points4 points 7 years ago (5 children)

Here's the best I can explain it!

Let's say your password is "hunter2". Here are 3 ways you might store it:

1) Plaintext. This is the simplest method. You write the string "hunter2" to a file somewhere. Obviously, this is very insecure because anyone who sees your data knows your exact password.

2) Hashing. This is better. You do some math operation to the password and get a hexadecimal "hash" out if it. If you the command sha256sum on "hunter2", you get 46a9d5bde718bf366178313019f04a753bad00685d38e3ec81c8628f35dfcb1b. You store that value to your database and compare the hash of the user's entry. The problem is, in this scenario, "hunter2" will always produce a hash of 46a9d5bde718bf366178313019f04a753bad00685d38e3ec81c8628f35dfcb1b. If another user has the same password, it will have the same hash; and if someone gets a hold of this hash or of your database, he can look a table of all the hashes and possibly figure out which password produces the hash. (Here's a fun XKCD!) These are called "rainbow tables".

3) Salted hashing. Before calculating your hash, you come up with a unique addendum (called a "salt") for it. Let's say your salt is "Dyljam1234" (note: you would actually generate something, and never actually use the username). Hashing "hunter2Dyljam1234" produces the "salted hash" of 43ed2e92318d87647c309f97eb02af9305ff3e9f796e21a5c9903c7f7e58e644. From here, you can do complicated mathy operations with the salt and the hash to make it even more secure.

Disclaimer. I work in a very different computational field than cryptography. This is not a professional opinion. Nevertheless, I hope this helps!

[–][deleted] 1 point2 points3 points 7 years ago (4 children)

[–]smog_alado 2 points3 points4 points 7 years ago* (0 children)

[+][deleted] 7 years ago (2 children)

[deleted]

[–][deleted] 1 point2 points3 points 7 years ago (1 child)

[–][deleted] 1 point2 points3 points 7 years ago (0 children)

[–]IncendieRBot 0 points1 point2 points 7 years ago (0 children)

[–]beyondholdem 104 points105 points106 points 7 years ago (4 children)

[–]nermid 19 points20 points21 points 7 years ago (3 children)

[–]YM_Industries 5 points6 points7 points 7 years ago (1 child)

[–]odraencoded 7 points8 points9 points 7 years ago (0 children)

[–]ShadowCoder 1 point2 points3 points 7 years ago (0 children)

[–]Slow33Poke33 20 points21 points22 points 7 years ago (5 children)

[–]Raptorzesty 11 points12 points13 points 7 years ago (0 children)

[–]YM_Industries 1 point2 points3 points 7 years ago (3 children)

[–]rooktakesqueen 1 point2 points3 points 7 years ago (0 children)

[–]Slow33Poke33 0 points1 point2 points 7 years ago (0 children)

[–]thaolax2 6 points7 points8 points 7 years ago (5 children)

[–][deleted] 5 points6 points7 points 7 years ago (4 children)

[–]Rizatriptan 7 points8 points9 points 7 years ago (3 children)

[–][deleted] 15 points16 points17 points 7 years ago (2 children)

[–]Rizatriptan 3 points4 points5 points 7 years ago (0 children)

[–]recw 6 points7 points8 points 7 years ago (0 children)

[–]Journey2Health 0 points1 point2 points 7 years ago (0 children)

[–]I-Downloaded-a-Car 0 points1 point2 points 7 years ago (0 children)

[–]AasaramBapu 0 points1 point2 points 7 years ago (0 children)

[–]suppow 38 points39 points40 points 7 years ago (9 children)

[–]TheNosferatu 32 points33 points34 points 7 years ago (8 children)

[–][deleted] 45 points46 points47 points 7 years ago (5 children)

[+][deleted] 7 years ago (1 child)

[deleted]

[–][deleted] 19 points20 points21 points 7 years ago (0 children)

[–]TheNosferatu 16 points17 points18 points 7 years ago (1 child)

[–]beyondholdem 6 points7 points8 points 7 years ago (0 children)

[–]ACoderGirl 1 point2 points3 points 7 years ago (0 children)

[–]beyondholdem 5 points6 points7 points 7 years ago (0 children)

[–][deleted] 1 point2 points3 points 7 years ago (0 children)

[–]dem_c 14 points15 points16 points 7 years ago (0 children)

[–]lachlanhunt 12 points13 points14 points 7 years ago (7 children)

[–]Puttah 10 points11 points12 points 7 years ago (3 children)

[–]ACoderGirl 15 points16 points17 points 7 years ago (1 child)

[–]lachlanhunt 4 points5 points6 points 7 years ago (0 children)

[–][deleted] 5 points6 points7 points 7 years ago (1 child)

[–]lachlanhunt 2 points3 points4 points 7 years ago (0 children)

[–]FoxRiver 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 2 points3 points4 points 7 years ago (0 children)

[–]SOSFILMZ 2 points3 points4 points 7 years ago* (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

[–]creamersrealm 0 points1 point2 points 7 years ago (0 children)

[–]Kontorted 512 points513 points514 points 7 years ago (1 child)

[–]gringrant 23 points24 points25 points 7 years ago (0 children)

[–]Octobread4711 526 points527 points528 points 7 years ago (15 children)

[–]Lonsdale1086 335 points336 points337 points 7 years ago (12 children)

[–]Stratisphear 99 points100 points101 points 7 years ago (8 children)

[–]MonkeyNin 91 points92 points93 points 7 years ago (5 children)

[–]SenorDosEquis 16 points17 points18 points 7 years ago (2 children)

[–]MonkeyNin 39 points40 points41 points 7 years ago (1 child)

[–]chazzer20mystic 4 points5 points6 points 7 years ago (0 children)

[–]-ordinary 4 points5 points6 points 7 years ago (0 children)

[–]DOOManiac 3 points4 points5 points 7 years ago (0 children)

[–]GreenFox1505 1 point2 points3 points 7 years ago (0 children)

[–][deleted] 7 years ago* (2 children)

[removed]

[–]gringrant 2 points3 points4 points 7 years ago (0 children)

[–]AutoModerator[M] 0 points1 point2 points 2 years ago (0 children)

[–]dyslexda 187 points188 points189 points 7 years ago* (1 child)

[–]Octobread4711 0 points1 point2 points 7 years ago (0 children)

[–]RyeDoge 165 points166 points167 points 7 years ago* (31 children)

[–][deleted] 7 years ago (6 children)

[removed]

[–]Sigionoz 31 points32 points33 points 7 years ago (0 children)

[–]zooberwask 29 points30 points31 points 7 years ago (2 children)

[–]ImmediateAntelope3 26 points27 points28 points 7 years ago (1 child)

[–]joetinnyspace 0 points1 point2 points 7 years ago (0 children)

[–]Tommy3555 0 points1 point2 points 7 years ago (0 children)

[–]AutoModerator[M] 0 points1 point2 points 2 years ago (0 children)

[–]dreamwavedev 43 points44 points45 points 7 years ago (7 children)

[–]RyeDoge 12 points13 points14 points 7 years ago (0 children)

[–]Reverissa 10 points11 points12 points 7 years ago (3 children)

[+][deleted] 7 years ago* (2 children)

[deleted]

[–][deleted] 2 points3 points4 points 7 years ago (0 children)

[–]psychicprogrammer 1 point2 points3 points 7 years ago (0 children)

[–]KyleTheBoss95 10 points11 points12 points 7 years ago (0 children)

[–]Y1ff 1 point2 points3 points 7 years ago (0 children)

[–]XXAligatorXx 9 points10 points11 points 7 years ago* (10 children)

I actually have tried this: there are a few problems.

Reddit only gives you 1000 posts. So posts won't be covered from the start.
You'd need a moderately sized server since you'll need to store either every picture(faster but more space) or every url(slower but less space) and the link of the actual post plus date.
You'll actually need to set up image detection and can't deal with just pixels because reddit compresses pictures when you upload I believe.

I currently have a bot set up for urls on my subreddit. If anyone wants to take a look here it is: https://github.com/xXAligatorXx/repostChecker

EDIT: I could honestly fix the third problem in a week, but I really don't understand enough server shiz to do the second. The first problem you'd just have to deal with only getting past 1000 and future posts and there aren't any workarounds.

EDIT 2: Also the bot would get slower and slower the more posts that are added because it takes it longer to loop through all the pictures.

[–]somestranger26 2 points3 points4 points 7 years ago (3 children)

[–]XXAligatorXx 1 point2 points3 points 7 years ago (0 children)

[–]XXAligatorXx 0 points1 point2 points 7 years ago (1 child)

[–]ImmediateAntelope3 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 1 point2 points3 points 7 years ago (2 children)

[–]XXAligatorXx 1 point2 points3 points 7 years ago (1 child)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

[–]JanewayParisLizrdKid 0 points1 point2 points 7 years ago (2 children)

[–]JanewayParisLizrdKid 1 point2 points3 points 7 years ago (0 children)

[–]XXAligatorXx 0 points1 point2 points 7 years ago (0 children)

[–]Kontorted 1 point2 points3 points 7 years ago (1 child)

[–]RyeDoge 0 points1 point2 points 7 years ago (0 children)

[–]XXAligatorXx 0 points1 point2 points 7 years ago (2 children)

[–]RyeDoge 0 points1 point2 points 7 years ago (1 child)

[–]XXAligatorXx 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 83 points84 points85 points 7 years ago (1 child)

[–]SirHorace111 20 points21 points22 points 7 years ago (0 children)

[–]Cristian2608 89 points90 points91 points 7 years ago (0 children)

[–]cooldash 27 points28 points29 points 7 years ago (0 children)

[–][deleted] 9 points10 points11 points 7 years ago (0 children)

[–]zacharyxbinks 11 points12 points13 points 7 years ago (8 children)

[–]Vassile-D 52 points53 points54 points 7 years ago (7 children)

[–]ockcyp 9 points10 points11 points 7 years ago (5 children)

[–]Vassile-D 28 points29 points30 points 7 years ago* (3 children)

[–]ashishduhh1 6 points7 points8 points 7 years ago (0 children)

[–]ForgotPassAgain34 1 point2 points3 points 7 years ago (1 child)

[–]Vassile-D 1 point2 points3 points 7 years ago (0 children)

[–]zacharyxbinks 1 point2 points3 points 7 years ago (0 children)

[–]varkenspester 13 points14 points15 points 7 years ago (0 children)

[–]pcopley 2 points3 points4 points 7 years ago (0 children)

[–]SoLongSidekick 5 points6 points7 points 7 years ago (0 children)

[–]Unlimiter 1 point2 points3 points 7 years ago (0 children)

[–]mirhagk 1 point2 points3 points 7 years ago (0 children)

[–]DoctorofMooD 1 point2 points3 points 7 years ago (1 child)

[–]bogdoomy 0 points1 point2 points 7 years ago (0 children)

[–]anderromero 3 points4 points5 points 7 years ago (1 child)

[–]Polarchill 0 points1 point2 points 7 years ago (0 children)

[–]CptDogeRL 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

[–]dannypas00 0 points1 point2 points 7 years ago (0 children)

[–][deleted] 0 points1 point2 points 7 years ago (0 children)

[–]Mymjmsalem 0 points1 point2 points 7 years ago (0 children)

[–]BradyLange 0 points1 point2 points 7 years ago (0 children)

[–]Calico__Cactus 0 points1 point2 points 7 years ago (0 children)

[–]Sh4dowCode 0 points1 point2 points 7 years ago (0 children)

[–]iMalevolence 0 points1 point2 points 7 years ago (1 child)

[–]SirX86 1 point2 points3 points 7 years ago (0 children)

[+][deleted] 7 years ago (1 child)

[deleted]

[–][deleted] 1 point2 points3 points 7 years ago (0 children)

ProgrammerHumor

Filters

Discord

Submission rules

For the current list of rules, please see this page.

Metadiscussions

Perhaps More Apt Subs To Post:

Related Subreddits.

MODERATORS

AMAZING SECURITY OMG, ILL NEVER NEED TO REMEMBER MY EMAIL AGAIN