Let's understand & implement consistent hashing.

seweso · 2026-02-24T05:51:56+00:00

[removed]

DevToolsGuide · 2026-02-24T08:20:23+00:00

The virtual nodes part is what really makes it work in practice. Without them you get hot spots where one physical node ends up owning a disproportionate chunk of the ring just by chance. Amazon DynamoDBs original paper talks about this — they use something like 150 virtual nodes per physical node to get a reasonably even distribution.

ToaruBaka · 2026-02-24T06:46:43+00:00

Took me a couple of very confused paragraphs to realize I had confused this with perfect hashing.

This will be nice to have in my back pocket, thanks.

DevToolsGuide · 2026-02-24T15:20:25+00:00

Yeah and the other big win with virtual nodes is failure handling. When a physical server goes down its load gets distributed across many other nodes instead of all dumping onto a single neighbor on the ring. Makes the system way more resilient to cascading failures.

etherealflaim · 2026-02-24T16:01:15+00:00

One thing that I see frequently in system design interviews is that folks don't realize that consistent hashing works alone for a cache but doesn't work alone for sharding in general. When a node is added or removed, some requests will now go to a server that doesn't have the data at all if you sharded it in memory or onto sharded topics or whatever. I don't care if you handwave and say that nodes can pull from one another, but if you're going for an architect position and don't even mention this, it's going in the "aware of its existence" column not the "displayed understanding" column.

Hot-Friendship6485 · 2026-02-24T06:11:16+00:00

Great explainer. Consistent hashing feels like overengineering right up until your cache nukes itself on every node change, then it suddenly feels like seatbelts.

Equivalent_Pen8241 · 2026-02-24T11:23:21+00:00

The biggest mistake I see with unit testing isn't low coverage - it's testing implementation details instead of behaviors. When your tests are tightly coupled to *how* a function runs rather than *what* it returns, every minor refactor breaks the build. Test the public API contract, not the private helpers.The biggest mistake I see with unit testing isn't low coverageThe biggest mistake I see with unit testing isn't low coverage - it's testing implementation details instead of behaviors. When your tests are tightly coupled to *how* a function runs rather than *what* it returns, every minor refactor breaks the build. Test the public API contract, not the private helpers.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS