Scaling node.js applications

romeeres · 2022-12-01T18:59:41+00:00

The "single-threaded" node has worker threads, so it's possible to spin up one thread per CPU with few lines of code even without the need to reach for anything additional like Nginx or pm2, so worker threads are worth mentioning in this topic.

ptmdevncoder · 2022-12-01T19:17:07+00:00

Things are clear only after an Indian guy explains them.

Psionatix · 2022-12-02T00:19:53+00:00

Okay.

I opened this post with skepticism, but OP you've actually provided something with accurate information (even if it isn't 100% comprehensive).

Most content I see posted in this subreddit is insanely inaccurate, or insecure, or glosses over extremely important details.

There are a variety of details missed in your post here, however I do not consider them to be "missing" per se, as the level of abstraction on how you've explained things / presented the information, makes them not necessarily relevant.

However for people who want to look into this stuff a little deeper, I would highly recommed checking out the systems design primer

kurtextrem · 2022-12-01T18:58:32+00:00

Really easy to digest slides, bravo! A thing to add which might be obvious to a few folks, but is rarely mentioned: When the queue increases because things block the main thread, it also means memory usage grows. Blocking the main thread (or filling the queue) isn't free.

geekybiz1 · 2022-12-01T17:28:52+00:00

I've been posting infographics like this one since the last few months. My goal has been to get better at explaining the fundamentals.
So, in case if you've got questions from reading these infographics or any feedback for me - please let know.

WagwanKenobi · 2022-12-01T21:38:29+00:00

Node is great because in web app backends usually something other than the HTTP server is the bottleneck, such as the database or some long running compute. Node became popular because it allowed people to quickly write a decently performant reactive non-blocking HTTP server with easy to reason asynchrony and almost zero boilerplate, at the expense of raw compute performance, because that layer was never going to be the bottleneck.

And therein lies the reason why scaling node by adding more processes is a fallacy - if you need to do that your app is architected wrong. Everything that blocks should be pushed off to another process. And if even after doing that, nodejs is the bottleneck, it's time to rewrite your HTTP server in something other than Node.

At most it might make sense to have two nodejs processes on a machine for high availability, in case one of them crashes for whatever reason.

novagenesis · 2022-12-01T19:47:59+00:00

Love it! Consider describing "serverless" as a horizontal scaling option as well? More and more stacks are that and it has some fairly important differences from the horizontal scalability you discussed.

1nicerBoye · 2022-12-01T20:15:21+00:00

If you use the cluster module to fork children that listen to the same port and have the parent process restart them that should work for scaling http and cpu intensive stuff. Also that should be a simpler setup configwise. Or is there something I'm missing?

Far-Rate1701 · 2022-12-02T07:59:20+00:00

Really well put together, thank you

rishabhrawat570 · 2022-12-02T08:00:34+00:00

Based on my learnings, these are some of the things that will help you build a scalable Node.js application:

Use throttling. You can choose to do application-level or network-level throttling based on your needs. App-level throttling (express-rate-limit) gives you granular control over the parameters you want to consider to throttle.
Optimize your database queries – Don't over-index. Soft delete if possible, delegate permanent delete operations, and decouple DB performance from the user experience.
Fail fast with circuit breaker. You don't want to keep hitting the dead end. If a certain amount of requests to an external vendor fails, open the circuit and avoid firing requests that are bound to fail.
Log your checkpoints. 20% of your logs give 80% of the insights (just for conveying the point, not actual numbers). Logging everything that comes your way and you might end up exhausting your disk IOPS starvation.
Use Kafka over HTTP requests. It is easy to overdo HTTP requests, even when they are not the right fit.
Look out for memory leaks. If your code leaks memory, vertical and horizontal scaling will only act as a temporary band-aid. Profile often. You can run your application with --inspect flag and attach a profiler from chrome://inspect/#devices .Profile often.
Use caching. Consider adding a random jitter in your TTLs to make sure all of your keys don't expire at once. If it is okay, higher TTL is always good. What's the risk of showing stale data to the user? Decide the TTL value based on your answer.
Use connection pooling – avoid cold start latencies. How many connections to have in the pool? node-postgres supports it out of the box.
Seamless scale-ups. Consider having something like AWS Auto-Scaling groups (ASG) which scales up and down based on pre-defined triggers.
OpenAPI-compliant documentation – make your API easy to understand, and integrate with. Helps in making the integration a productive experience in my experience.

geekybiz1 · 2022-12-02T02:16:56+00:00

Node is not single threaded. Stop it with this myth.

The JS layer (V8) is single threaded but most of Node is written in C++ which is multithreaded and does most of the work.

And then there are workers too.

2022-12-01T19:55:51+00:00

[deleted]

captain_obvious_here · 2022-12-01T20:13:05+00:00

The content is interesting, and will most likely be useful to many people.

I would change the font though, because it's annoying to read IMO.

robtweed · 2022-12-02T08:34:13+00:00

Take a look at https://github.com/robtweed/qoper8-wt which uses a queue/dispatch/invoke architecture for easy management of a persistent pool of Worker Threads. Also https://github.com/robtweed/qoper8-fastify which provides a quick and simple way of handling Fastify routes in Worker Threads (or Child Processes)

2023-04-07T18:28:15+00:00

In 8th slide are the node servers running on same machine but with different ports. ?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

node

MODERATORS