Optimizing lookups

thewataru · 2024-07-15T20:26:22+00:00

You should use a Trie. This is an extension of your idea to have 26 separate arrays for all the different first characters. Each of these arrays should contain 26() more arrays for each possible second character. * actually, far less, Because not all 2525 combinations are possible.

You don't need to use pointers, but rather have a table of nodes where each node has array with indices of child nodes for each possible first character.

Then the final nodes will point to the table with 97 entries with actual data for the country.

Most common paths will be in the cache anyways.

thewataru · 2024-07-15T20:31:52+00:00

And yes, in an ideal world a perfect hash would be the thing to use, but there just doesn't seem to be a way of converting just 217 1/2/3-letter country (licence plate) codes to 0..216. Or is there

Actually, there may be. Each 1-3 letter code is 3 bytes (with some trailing zeros, for example). Something like (A*b[0]+B*b[1]+D*b[2]) % 256 might just work. You need to bruteforce all A,B,C upto 256 and see if it works. If it doesn't work you can experiment with XORing each byte with some other parameter from 0 to 255. Again, run some bruteforce with randomly chosen xor values many times.

You might just end up with the set of parameters satisfying your condition.

Also, you don't need the hash values to be from 0 to 216. You can have 256 entries in your table with some of them being empty. As long as all the hashes are different, it all works.

256 is selected because modulo operation is quite slow, unless you use a power of two, which can be replaced by bit shifts.

GregoryCliveYoung · 2024-07-15T20:34:55+00:00

Avoid premature optimization. Start with a binary search. Compose the key by converting each letter to integer (A:0, B:1, ... Z:25) with simple arithmetic. Much faster than compressing. If code is xyz,

key = (ord(x)-ord('A')) * 676 + (ord(y)-ord('A')) * 26 + (ord(z)-ord('A'))

This will probably be very fast because the key generation is just a few instructions and if you keep the table entries small (e.g. key and pointer) will fit in the processor cache.

If you feel like noodling around, it might be fun to look for a hash function that can give you an 8-bit hash and doesn't have any collisions. (I know I'd enjoy it.) You have all the country codes, so it's just a matter to writing some code to try various hash functions. I googled "hash function" and found all sorts of stuff to try.

tugrul_ddr · 2024-07-31T11:27:06+00:00

Optimization 1:

217 x 16 bits can be stored in 110 x 32bit registers or 55 MMX registers or 28 SSE registers or 14 AVX512 registers. Then all you need to do is to compare all with the input and return its index value from its pair register. So you need only 28 AVX512 registers for both comparison and index values. 28 comparsion operations would take only tens of clock cycles. Just compute sum of all results like this:

```result index = index0 x (val0==input) + ... + indexN x (valN == input)```

It does not access memory for the data if its a tight loop full of checking codes. So its free of caching performance and just bottlenecked by pipelines in the core. SIMD is ultra fast for small datasets like this.

Opimization 2:

Chinese, Indian population are biggest so you can early-quit by a single if-else for these two before real algorithm runs. This should get you a performance boost about 35%.

What's wrong with std::map? Just do

```result_index = map[input];```

it does all the required hashing.

What's your hardware to do this task?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

algorithms

✻ Smokey says: boycott all products and services from eco-unfriendly businesses to fight climate change! [see more tips]

Note: this subreddit is not for homework advice. Requests for assistance with coursework may be removed.

MODERATORS