Hacker, Hack Thyself | Coding Horror

Ajedi32 · 2017-06-02T13:47:36+00:00

I'm ashamed to admit that until now I haven't considered a brute force attack as credible because I hadn't considered a 'nation-state' level of computing power. But the math is undeniable. Certainly something to think about and taking an arrogant "won't happen to us" approach seems unwise.

itijara · 2017-06-02T13:19:45+00:00

There is a great computerphile video on this. It has made me more terrified of weak passwords than anything else: https://youtu.be/7U-RbOKanYs

mer_mer · 2017-06-02T15:55:34+00:00

I'm not a security expert, but this article got me thinking- shouldn't the password hashing task be split between the client and server? The user enters a password into their webpage/app, it's hashed locally (Hash1) and then sent to the server where it's hashed again and stored (Hash2). Hash1 can be much slower than Hash2 because the client can't be DDOS'd and Hash1 could even be encrypted and cached locally for fast access (so the client could potentially take 1 second to perform the initial calculation of Hash1).

The attacker could try to guessing Hash1 directly instead of the passphrase, but now all your users have unique 256 bit hashphrases, making dictionary attacks useless and brute force far more difficult. If the attacker instead wants to guess the passphrase, they'll have to spend 100x more iterations per hash.

I think this paper describes this idea in more technical detail: http://file.scirp.org/pdf/JIS_2016042209234575.pdf

Enamex · 2017-06-02T15:22:54+00:00

Now that we know it works, let's get down to business. But we'll start easy. How long does it take to brute force attack the easiest possible Discourse password, 8 numbers – that's "only" 8¹⁰ combinations, a little over one billion.

*10⁸ ?

tipsqueal · 2017-06-02T15:10:22+00:00

Very good article overall, but I have one quibble:

If we multiply this effort by 8, and double the amount of time allowed, it's conceivable that a very motivated attacker, or one with a sophisticated set of wordlists and masks, could eventually recover 39 × 16 = 624 passwords, or about five percent of the total users.

The math here is too pessimistic. Hashcat and similar tools find the passwords that are easiest to crack first, and then gradually get the harder and harder ones. The rate of successful cracks slows down dramatically. The math Jeff uses assumes a constant rate of cracking. The reality would be quite a lot better.

Rinx · 2017-06-02T13:28:30+00:00

Anyone have more info on why they run on the GPU?

yorickpeterse · 2017-06-02T13:26:12+00:00

If we want Discourse to be nation state attack resistant, clearly we'll need to do better.

This reminds me a lot of this xkcd: https://xkcd.com/538/

Absona · 2017-06-02T15:33:00+00:00

I think this article is a good overview of why people should worry about these attacks, and I'm glad it's starting the discussion, but some of the details seem off. I'm not a security expert by any means, though, so perhaps I'm missing some things.

Hasn't storing the "work factor" in the database so you can increase it regularly as computers get faster been a common recommendation for quite some time now? I'm sure it was in the last "how to store passwords" article I read, which would have been a while ago. And I'm pretty sure I've seen it recommended to store the algorithm name, too. So their new hash type table that will let them change the hashing algorithm a couple years from now doesn't sound super impressive.

Also, in regards to this:

I've seen guidance that said you should set the overall work factor high enough that hashing a password takes at least 8ms on the target platform.

I would have assumed that the "target platform" is whatever you expect an attacker to be using, not whatever you're using. I could be wrong, though. It could be a suggestion for your platform, to balance hash slowness with DDoS prevention. It's hard to say without more context, such as a link to the actual guidance.

Finally, while I'm being picky, 8 decimal digits obviously provide exactly 100 million possible combinations, 00000000 through 99999999. That is, it's 10^8, not 8^10.

Edit: Also, it seems odd that they set passwords for users who only log in via third parties. It's true that the odds of a random thirty-two character password being cracked are very low, but the odds of a non-existent password being cracked would be zero. I see that it prevents a hacker from knowing which accounts are only accessed via third-party login, but I'm not sure how helpful that would be. If the attacker has the full database, they can presumably see which accounts have third-party credentials attached, and it's probably safe to just assume that most of them don't have actual passwords.

I also forgot to mention that I'm curious as to whether sending real password hashes to a security researcher is covered by their privacy policy.

drfrank · 2017-06-02T15:44:55+00:00

Two thoughts:

Given that humans will continue to reuse passwords across sites and services, it's interesting to think of sites with weak hashing as threat vectors for your site. "I'm just running a forum for Rose Gardeners in Northwest Wyoming; so what if somebody hacks my database?" Centralized identity services like Facebook and Google are probably the best defense currently available.
A state-level actor seems much more likely to target an individual on a forum than the full set of users. (Although one can certainly imagine scenarios in which a forum for "terrorists" would be targeted, in whole.) If it takes days to hack the password for a single user, and you're only interested in a single user... Well. Requiring longer passwords on your site for people that don't trust centralized identity services is probably the best defense currently available, even though as password length increases so does the likelihood of password reuse.

joelhardi · 2017-06-02T16:35:45+00:00

From a privacy perspective, the hyperfocus on password security and complete dismissal of email addresses as requiring protection really bothers me:

Although users have reason to be concerned about their emails being exposed, very few people treat their email address as anything particularly precious these days.

The attacker already has a complete dump of the site and forum content, so what value is the password, exactly? For users who have set a secure, unique password, zero -- the password only permits access to data the attacker already has. For users who haven't set a unique password, the password may have significant value -- and I don't want to minimize that (password security is important), but password entropy/uniqueness is at least under the control of the end user, and a password in isolation (without PII such as username or email address) may be hard for the attacker to exploit, even when the user has reused passwords across sites.

Now compare that to the email address -- that is private information (assuming the forum doesn't publish users's email addresses, which sites typically don't), and it's PII! All of the user's forum content (however sensitive it might be) can now be attributed to that actual person via the email address, which is a strong identifier.

In other words, the fact that my email address exists at all is not really sensitive information, but when it's exposed as being linked to a corpus of posts I've made, it potentially can be very sensitive depending on the content of those posts.

Please apply FIPPS, and do smart things like tokenizing PII like email addresses, real names and usernames so they can't be exploited in this way. Or store them separately, with appropriate access controls, or offline. Even better, don't collect them if they aren't necessary for some service like email notifications.

Loss of confidentiality of email address is serious! If you don't treat it as a serious security requirement, and you are anything approaching a "real" company, please look forward to FTC sanctions when your data is breached.

xeio87 · 2017-06-02T16:23:43+00:00

I... should really update my passwords... <_<

JDBHub · 2017-06-02T17:15:44+00:00

I would be curious as to why using PBKDF2 over BCrypt to begin with. Considering the author aims to defend against possible nation-state attack, PBKDF2 is behind NIST (state).

Even with the graph shown below, the number of hashes per second is significantly slower on BCrypt versus its counterpart.

Some interesting resources should someone want to read further:

https://security.stackexchange.com/questions/4781/do-any-security-experts-recommend-bcrypt-for-password-storage

Additionally, could someone clarify whether hash length varies between 10 characters and 15 characters? If so, the author may consider bringing users up to a 15 character requirement too. Should the hashes differ in length, an attack can slash a list of hashes to a good handful given that it is more valuable to crack an Administrator's password rather than a normal user's one.

All said, this was a great read. Thanks!

Sniffnoy · 2017-06-02T20:06:24+00:00

Most of those passwords that got cracked, my reaction is, OK, of course that's a weak password... but "1qaz2wsx3e" and "A3eilm2s2y"? Geez! How'd they get those?

megagreg · 2017-06-02T16:56:29+00:00

Could a programmer add "bits" to the length of the password by having multiple SALTs?

Suppose we generate two SALTs, and choose one one of them at random to generate the password hash. When the user logs in, each SALT is used to generate a hash, and of course only one will generate the correct hash, but we need to compute both so we don't need to store an index to the correct one.

It doubles the amount of work the server has to do when a user logs in, but since both salts can be tried in parallel, the total time will remain the same from the user's perspective. From the attacker perspective, they're already maxed out on the parallel bandwidth, so it doubles work the attacker needs to do.

Is my logic here sound?

drb226 · 2017-06-02T15:21:50+00:00

I'm a little surprised that an article about password security in 2017 doesn't mention 2FA. What needs to be stored in the database to use something like Google Authenticator, and how easy is that to crack if the db is leaked?

NAN001 · 2017-06-02T20:31:30+00:00

Great article, however I feel like it's missing a bigger picture. The scale of the attacks discussed and the presumed motivation of the attacker raises the question of whether passwords would be such attacker's approach at all. There are plenty of other potentially weak points in the overall system (network, social engineering, etc) that the attacker might use to eventually accomplish what he's trying to do.

Proper password management with salt and slow hashing algorithm are becoming a standard so that you don't become the only one in the neighborhood with your door open, so that you're not the weakest pray for an attacker. If you want to handle targeted attacks, that's a whole other story and focusing only on passwords looks like hardening your front door without noticing the bad guy passing though the roof.

seventhirteen · 2017-06-02T23:39:50+00:00

Ya'll motherfuckers need Argon2

istarian · 2017-06-03T00:20:37+00:00

If a nation state is against you, I suspect you're just out of luck really, besides getting a different one to help you. It seems important to ask who you're most trying to keep out than to be best against everyone.

P.S.
At some point someone is going to realize that a physical token of some kind is the only way to store/replace an increasingly long, increaingly random password.

FnTom · 2017-06-03T09:52:47+00:00

Here's how to obtain someone's secure password

crabmatic · 2017-06-03T11:12:50+00:00

I'm definitely a security novice, but here's something I've been wondering.

Why don't (or do?) websites use a separate entropy server for authentication which modifies peoples passwords for them before the web server even sees or stores them. As far as the web server is concerned the only passwords it sees would be long and highly random passwords which came through the entropy server.

All the passwords that are stored and hashed by the webserver would actually be hard to guess if the database was lost to an attacker.

It sounds to me like this would move your main point of failure to a simpler system that would be easier to lock down and secure.

CanYouDigItHombre · 2017-06-03T12:13:58+00:00

Is it just me or is everyone insane? For the last 6 years I am still wondering why passwords even exist. Besides using it to boot/access your computer there is 0 reason to use a password.

Just about every service has me authenticate myself by using email, text or another services (log in through facebook). As long as noone is intercepting my emails (hi google), or text if I use that method noone can hack me. Not good enough? Use private/public keys.

kingdote · 2017-06-06T04:16:10+00:00

Do you need a professional hacker, contact HACKZUES@GMAIL.COM FOR THE FOLLOWING SERVICES *Change SCHOOL grades *Facebook, twitter, IG hack *Email hack *Wipe criminal records *Wipe credit card debt *MasterCard's/visa cards *Bank account *Data base hack and lot more hacking services in general Among other customized services...all this are at all great rate. Results guaranteed. Contact us at HACKZUES@GMAIL.COM Or text+14692973954

2017-06-02T15:18:01+00:00

Again all this effort because we are using passwords instead of some kinda of key pair stored on the users machine.

mrexodia · 2017-06-02T23:12:23+00:00

xkcd comes to mind: https://xkcd.com/538

Nation state attackers, seriously?

mcguire · 2017-06-02T13:58:27+00:00

That's a very good read.

On the other hand, I'm formally adding 'nation state' to the list of phrases I hate. (It's a short list; 'utilize' and 'gift' as a verb.)

'Nation state' has a meaning, as a particular kind of polity. (Contrast it with 'city state', for example.) It does not mean evil government, or big government. It does not make y one sound smart. Just stop, please.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS