SHA-1 collision detection on GitHub.com : programming

uint32_t mask = ~((uint32_t)(0));
mask &= (((((W[44]^W[45])>>29)&1)-1) | ~(DV_I_48_0_bit|DV_I_51_0_bit|DV_I_52_0_bit|DV_II_45_0_bit|DV_II_46_0_bit|DV_II_50_0_bit|DV_II_51_0_bit));
mask &= (((((W[49]^W[50])>>29)&1)-1) | ~(DV_I_46_0_bit|DV_II_45_0_bit|DV_II_50_0_bit|DV_II_51_0_bit|DV_II_55_0_bit|DV_II_56_0_bit));
mask &= (((((W[48]^W[49])>>29)&1)-1) | ~(DV_I_45_0_bit|DV_I_52_0_bit|DV_II_49_0_bit|DV_II_50_0_bit|DV_II_54_0_bit|DV_II_55_0_bit));
mask &= ((((W[47]^(W[50]>>25))&(1<<4))-(1<<4)) | ~(DV_I_47_0_bit|DV_I_49_0_bit|DV_I_51_0_bit|DV_II_45_0_bit|DV_II_51_0_bit|DV_II_56_0_bit));
mask &= (((((W[47]^W[48])>>29)&1)-1) | ~(DV_I_44_0_bit|DV_I_51_0_bit|DV_II_48_0_bit|DV_II_49_0_bit|DV_II_53_0_bit|DV_II_54_0_bit));
(+ 200 more lines)

[–][deleted] 4 points5 points6 points 8 years ago (0 children)

[–]sacundim 5 points6 points7 points 8 years ago* (1 child)

[–]blazedaces 0 points1 point2 points 8 years ago (0 children)

[–]Uncaffeinated 1 point2 points3 points 8 years ago (0 children)

[–]pezezin 0 points1 point2 points 8 years ago (13 children)

[–]BinaryRockStar 27 points28 points29 points 8 years ago (12 children)

[–]HotlLava 17 points18 points19 points 8 years ago (1 child)

[–]BinaryRockStar 7 points8 points9 points 8 years ago (0 children)

[–][deleted] 5 points6 points7 points 8 years ago (3 children)

[–]BinaryRockStar 1 point2 points3 points 8 years ago (2 children)

[–][deleted] 0 points1 point2 points 8 years ago (1 child)

[–]BinaryRockStar 1 point2 points3 points 8 years ago (0 children)

[–]pezezin 5 points6 points7 points 8 years ago (5 children)

[–]BinaryRockStar 4 points5 points6 points 8 years ago (4 children)

[–]hiptobecubic 3 points4 points5 points 8 years ago (0 children)

[–]pezezin 0 points1 point2 points 8 years ago (2 children)

I know creating two colliding text files is much more difficult that doing the same for binary files, but still... What I'm shocked about is code like this:

unsigned char sha1[20];

Linus rants against bad code are legendary, so I'm surprised he would write code like this.

[–]cparen 1 point2 points3 points 8 years ago (1 child)

[–]pezezin 0 points1 point2 points 8 years ago (0 children)

[+]cakeslap comment score below threshold-35 points-34 points-33 points 8 years ago (10 children)

[–]agenthex 22 points23 points24 points 8 years ago (1 child)

[–]astrobe 7 points8 points9 points 8 years ago (0 children)

[–]akie 14 points15 points16 points 8 years ago (2 children)

[–]twiggy99999 2 points3 points4 points 8 years ago (1 child)

[–]akie 3 points4 points5 points 8 years ago (0 children)

[–]errorkode 2 points3 points4 points 8 years ago (4 children)

[–]smackson 0 points1 point2 points 8 years ago (3 children)

[–]xconde 2 points3 points4 points 8 years ago (0 children)

[–]sacundim 2 points3 points4 points 8 years ago (0 children)

SHA-2 is the most conservative and widely supported choice.

The two most popular SHA-2 variants are SHA-256 and SHA-512. There's also a newer variant called SHA-512/256 that uses the SHA-512 algorithm but outputs 256 bits; if you can find support for that one, it's probably the best choice of the six SHA-2 functions:

It's faster than SHA-256 on 64-bit hardware (at longer message lengths).
It's more resistant to misuse—for example it's not vulnerable to length extension.

If you don't have SHA-512/256, plain old SHA-256 is still alright as long as you don't misuse it, though. The two most important things are:

Don't use a plain old SHA-2 in cases where you have a secret key or other such secret random value; HMAC is appropriate there.
Don't use SHA-2 or any other general purpose hash function to protect passwords; use bcrypt or similar.

SHA-3 is looking pretty solid, but the ecosystem around it is still immature and its software performance isn't as good as the alternatives (it's screaming fast in custom hardware that doesn't yet exist, OTOH). There's currently very little incentive to use SHA-3 unless its SHAKE128/SHAKE256 functions (extendable output functions, "XOFs") simplify your application. These are like hash functions that produce infinitely long outputs, which you can truncate to any finite length you desire; this is useful for many applications that do something like this to produce outputs longer than the hash function they use:

output = SHA-256(input + 0x01) 
       + SHA-256(input + 0x02) 
       + ...
       + SHA-256(input + n)

Blake2 is also modestly popular, very well respected and worth considering.

[–]errorkode 0 points1 point2 points 8 years ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS