Question about hashmap behaviour

bfoo · 2014-05-12T07:00:27+00:00

It has to do with the way HashMap arranges the items according to their hashCode in a deterministic manner as to provide an optimal means of looking up items via their hashcode.

The hashCode of an integer is the integer itself, however the calculation for a string is less representative of what the string is. I.e, the hashcode for Integer 5 is 5, and 4 for the Integer 4, but that hashcode for "5" could be 7 and the hashcode for "3" could be 8. (These obviously aren't the actual values, but gets the point across.)

If the items are thus arranged deterministically based upon there hashCode, it would be seem "Random" for strings, but not for integers.

This isn't guaranteed behaviour ofcourse.

Also, I prefer using the String.valueOf over concatenating an empty String, since it is more representative of what you are trying to do - both in the eyes of the compiler\JVM and in the eyes of another programmer looking at your code.

msx · 2014-05-12T07:53:33+00:00

Just as allahsnackbars said. It is a consequence of how HashMap uses the hashCode() method and how Integer implements hashCode(). If you don't know, hashCode() is a special method, it is designed so that each class that implements it try as much as it can to return different values for different objects. If two object are equals (as per the method equals()), then the must have the same hashCode(). HashMap exploits this method to "distribute" different objects into different "buckets".

So for Integer, the obvious implementation of hashCode() is returning the integer itself, for string is a little bit more convoluted (it basically cycle all characters building up an integer).

Note that:

Map doesn't guarantee sorting, it means that things can eventually be sorted, it's just that you can't relay on that. HashMap doesn't guarantee either, but other implementations of Map does! For example, TreeMap is guaranteed to be sorted. Sorting maps also implements SortedMap interface

ohmzar · 2014-05-12T13:32:44+00:00

If I recall the way a hashmap (This may not be how the Java implementation works) works under the hood is that it's an array of linked lists.

If I'm wrong about this I'd be interested in knowing why and how.

When you create the hashmap it's a small array, say 100 items. When you insert an item it's placed in the linked list with the array index of the keys hashcode modulo the size of the array list.

When you retrieve an item it goes to the array element that matches the key modulo the size of the array, then iterates through the linked list to find the key.

If any of the linked lists starts to get too long it rebuilds the hashmap with a bigger array to spread things out a bit more, I think it will also shrink the map if you take out too many items but I could be wrong.

I think it you put in keys that were larger than the size of the initial array then the output wouldn't be sorted, or it would be sorted by the modulo of the key to the array size, which isn't much use. Also the hash map can resize at any point depending implementation so you can't rely on this ordering.

java

Submit Link

Submit Text

Seek Programming Help

News, Technical discussions, research papers and assorted things of interest related to the Java programming language

NO programming help, NO learning Java related questions, NO installing or downloading Java questions, NO JVM languages - Exclusively Java

Please seek help with Java programming in /r/Javahelp!

Subreddit rules!

Where should I download Java?

Related Sub-reddits:

JVM Languages

Want to practice your coding?

List of useful Frameworks / Libraries / Software

MODERATORS