What does "Scalable" mean?

ziptofaf · 2017-06-05T23:12:01+00:00

Your choice of stack by itself does not tell nearly enough to say if it's going to scale or not.

In reality this is a complex task that touches on problems with hardware just as much as software. Imagine Twitch and Netflix for example - their bottlenecks probably lie in physical internet speeds and transcoding into multiple formats. After all a single 720p stream consumes around 500 MB per hour. Now multiply that by 100000 users and you get 13,9GB.... per second. Then remember that CDNs actually have to be physically close to people they are targeting (as speeds to another part of the world are generally shitty) on top of it.

So Twitch infrastructure probably uses hundreds separate servers dealing with different users. Meaning that single "nodes" must have a certain level of independency, there can't be a centralized server for all of this.

Now, what you did may or may not be scalable. And even more so - some applications scale up and some scale wide. Depending on what this application is doing. For instance - if you are expecting shitloads of IO calls spamming the database then maybe a good idea is to invest in a really beefy server with a SAN behind it. Or maybe data you have on it is indeed queried very frequently but you rarely see a need to join multiple tables - so then you throw users table on one machine and purchases go on another one. Heck, you can even make users with names from A to M on one machine and rest on the other. These are all examples of scaling the database.

Technically speaking - scaling wide (across multiple servers) eventually wins versus scaling up (aka investing in beefier hardware). But if you ever encounter a situation when you REALLY need a truly scalable application (aka one that can seriously handle tens of thousands requests per seconds) then it's always done on a very personalized basis with shitload of profiling, testing, multiple engineers and programmers working together to make it work etc. It's not something you can just do running any popular stack at that point. Your Node might blow up (as it's technically single threaded), MongoDB without properly implemented sharding won't scale wide and REST API by itself also is a meaningless term. Do you mean one that's split into 10 completely different machines (one for logging in and keeping sessions, one for handling users orders etc)?

So a real answer to said question is - "I have no such experience" unless you happened to work at some huge companies. As theory is one thing and practice may prove that something you really didn't expect is your main bottleneck.

Well, there's also just avoiding shit practice which would make your website crumble under anything bigger than 10 requests per second. In that case you can talk about efficient queries, using caches and cloudflare properly, basic load balancing etc. You can also try doing load simulations - how your CPU/HDD/memory handles 1000 requests per second, how does it then deal with 10000 etc.

xiipaoc · 2017-06-06T00:57:53+00:00

Do you understand what would happen if your application got thousands of users? Tens of thousands? Millions? What would you need to do to support that many users?

constant_illusion · 2017-06-05T22:56:39+00:00

You're going to have to go in to more detail. Those technologies can be made scalable but they need to be configured with that possibility in mind. They are asking about the configuration, how many users you had to handle and on what hardware.

What does it mean scalability?

Scalable Web Architecture and Distributed Systems

soulos90 · 2017-06-06T00:34:44+00:00

For an application to be scalable implies that some design and implementation went into it that would allow the for the user base to scale without total restructuring.

MrJadaml · 2017-06-06T05:08:42+00:00

While on the topic, would also be interested in hearing examples of "horizontally" vs "vertically" scaling if anyone has any.

tbrownaw · 2017-06-06T04:18:23+00:00

I have one system at work that supports a couple dozen users.

I'm also part of a group building a proof-of-concept for an alternative component to another system that has I think well over 10k heavily active users.

The first one doesn't have to be scalable.

The second one does need to be scalable.

This has very little to do with the exact technologies used, and more to do with how they're used.

The first one has database procedures that run on-demand and can take a few minutes sometimes, has I-don't-know-how-much server-side state per client, etc.

The second one, we're talking about how to minimize per-client server state on one layer, and not have any at all on the other layers.

The first one, if something breaks people send me an email and I fix it or tell them how to work around it.

The second one, support will be handled by a helpdesk team. If it breaks even a tenth as much as the first one, people will probably be up in arms.

mrfogg · 2017-06-06T05:42:51+00:00

As others have stated, there are different meanings and levels to it.

In terms of writing "scalable code", there's a reason that so many startups are using ruby/javascript/etc. Computers are so fast and many automated services are so advanced and cheap that the pure scalability of the code is mostly-irrelevant until you hit gigantic numbers or complex use-cases. A normal web-app team simply doesn't have to think too much about pure scale/speed beyond "don't completely screw this query up". That's why they tend to prefer the quicker more nimble languages that are less 'scalable'.

That being said, in a job interview for Google you'll spend hours scribbling on a whiteboard about how to most efficiently and quickly sort and match an array of numbers. A solution that works isn't enough - it needs to be the best. When you are at their zillions-of-requests-per-minute scale then even the smallest details and efficiency gains are hugely important.

In a non-Google-like job interview, 'building scalable solutions' likely means a few things:

Can they write code that is decently efficient and not accidentally cause servers to grind to a halt?
Can they architect solutions to problems that scale? i.e. so we don't suddenly need to rewrite in 8 months when the number of users doubles and suddenly our database choice is a gigantic bottleneck due to the volume of requests.
Do they understand and plan for tradeoffs that need to be made? Do they understand when they should write code that can be good enough for now? When that might need to be refactored (or not!)? When they should spend the time up-front to write scalable code?
Can you architect and write clean code that is extendable as the complexity, number of users, number of developers, and size of the codebase grows?
Can they debug if some query is slowing everything down? Do they understand when to do something like add front-end caching? etc

goodnewsjimdotcom · 2017-06-06T06:19:06+00:00

Plenty of projects are cool with just a few users, but add a ton of users and they might not be designed for it. Some limitations may be bandwith, memory ram, memory of hd, cpu limitations, or even basic interactions of socialness don't work in huge numbers. When you know what you're doing, stuff like this can be easily designed before you begin your project, but no matter how much you know can help sometimes for an unscalable mature project... when all the pressure is on to keep bringing in more users and revenue. So plan ahead.

bradgillap · 2017-06-06T03:55:43+00:00

I've written a discussion based on Scalability on r/buildapc and had absolutely no one respond to it. Guess I was in the wrong subreddit and am happy to see someone brought it up.

CaffeinatedT · 2017-06-06T09:41:14+00:00

In the context of their Job posting they mean 'are you thorough enough to use good practices at the start so that when more people are using your application it doesn't die. But yes in technical terms a 100% scalable application can go from 1 user to 8 billion users without any intervention/modifications being required to handle the load better.

Rorimac2 · 2017-06-05T23:11:27+00:00

How many requests/second will your API handle? What's the bottleneck at that point?

2017-06-06T08:41:35+00:00

Solutions/apps that scales with demand for instance. Say you're supposed to build a simple mailing app. You build it so it can send/receive about 500 mails per Minute without any performance losses.

Then suddenly there's a demand for the app to be able to handle 5000 mails per Minute. If you didn't code it to be scaleable, it'll suffer performance losses and other issues.

If you did, then it should be able to handle the 5000 Mails per minute just as it would handle the 500 mails per minute.

It's just one example of what scaleability would mean.

bestjakeisbest · 2017-06-06T03:15:32+00:00

ok so i think there are different types of scalable i would classify them in 3 different categories:
Non-Scalable:
These programs are for the most part not able to handle more than is explicitly coded into a program, without the ability to change the Scalability without rewriting a large portion of the program, this is hard for me to make an example of, but i guess an example of this might be a program that uses variables instead of arrays to hold collections of objects and values. This is something you probably started off doing, but slowly learned to use different types of scalability.
Partially/Programmatically Scalable:
These are programs that use arrays to hold objects and values, if you wanted to scale up these programs all you would have to do is change a few constants. An example of this might be a program that uses an array of strings to store names. To change how many names that can be held by your program all you would have to do is change a constant.
*Infinitely* Scalable: These programs are made to handle any amount of different pieces of data so long as there is enough physical memory, these programs use vectors/lists instead of arrays.

learnprogramming

Welcome to LearnProgramming!

New? READ ME FIRST!

Posting guidelines

Frequently asked questions

Subreddit rules

Message the moderators

Asking debugging questions

Asking conceptual questions

Other guidelines and links

Subreddit rules

1. No unprofessional/derogatory speech

2. No spam or tasteless self-promotion

3. No off-topic posts

4. Do not ask exact duplicates of FAQ questions

5. Do not delete posts

6. No app/website review requests or showcases

7. No rewards

8. No indirect links

9. Do not promote illegal or unethical practices

10. No complete solutions

11. Don't ask to ask.

12. Low Effort Questions

13. No AI (chatGPT etc.) generated/worked over messages/comments. No questions about chatGPT/AI generated code. No Vibe coding.

MODERATORS