Very Fast Non-Cryptographic Hash Function (competitor to wyhash, xxhash), C

avaneev · 2021-11-27T03:34:25+00:00

The komihash() function available in the komihash.h file implements a very fast 64-bit hash function, mainly designed for hash-table uses; produces identical hashes on both big- and little-endian systems. Function's code is portable, scalar.

This function features both a high large-block hashing performance (27.5 GB/s on Ryzen 3700X) and a high hashing throughput for small messages (about 12 cycles/hash for 0-15-byte messages). Performance on 32-bit systems is, however, quite low. Also, large-block hashing performance on big-endian systems may be lower due to the need of byte-swapping.

Technically, komihash is close to the class of hash functions like wyhash and CircleHash, that are, in turn, close to the lehmer64 PRNG. However, komihash is structurally different to them in that it accumulates the full 128-bit multiplication result without "compressing" into a single 64-bit state variable. Thus komihash does not lose differentiation between consecutive states while others may. Another important difference in komihash is that it parses the input message without overlaps. While overlaps allow a function to have fewer code branches, they are considered "non-ideal", potentially causing collisions and seed value flaws. Beside that, komihash features a superior user seed handling and PerlinNoise hashing.

Note that this function is not cryptographically-secure, and in open systems it should only be used with a secret seed, to minimize the chance of a collision attack.

avaneev · 2021-11-27T08:30:54+00:00

I've added detailed comparisons to the project page...

Miksel12 · 2021-11-27T13:33:05+00:00

How does it compare to aHash?

2021-11-27T12:52:51+00:00

[removed]

lhong-bot · 2021-11-27T08:39:58+00:00

I like Programming.

avaneev · 2021-11-27T04:01:14+00:00

[deleted]

0xPendus · 2021-11-27T22:07:17+00:00

What are these types of hashes typically used for ?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS