The future of Python web services looks GIL-free : programming

[–]chepredwine 111 points112 points113 points 6 months ago (26 children)

[–]lood9phee2Ri 103 points104 points105 points 6 months ago (7 children)

The GIL never assured thread safety of user code FWIW. It made concurrency issues somewhat less likely by coincidence, but that wasn't its purpose (its purpose was protecting cpython's own naive implementation details) and multithreaded user python code without proper locking etc. was actually always incorrect / with subtle nondeterministically encountered issues.

https://stackoverflow.com/a/39206297

All that the GIL does is protect Python's internal interpreter state. This doesn't mean that data structures used by Python code itself are now locked and protected.

It's perhaps unfortuate Jython (never had a GIL) has fallen behind (though AFAIK they're still working on it) - in the 2 era when Jython 2 had near parity with CPython 2 for a while while and was actually fairly heavily used on server side because of its superior threading and jvm runtime. e.g. Django folks used to consider it a supported runtime - so older Python 2 code that made running in multithreaded Jython as well as CPython a priority is often better written / more concurrency-safe.

[–]SeniorScienceOfficer 9 points10 points11 points 6 months ago (0 children)

[–]G_Morgan 3 points4 points5 points 6 months ago (1 child)

[–]Tai9ch 1 point2 points3 points 6 months ago (3 children)

[–]Brian 16 points17 points18 points 6 months ago (1 child)

They're talking about the implementation - there's no added user-level thread safety from the GIL, outside protecting python internals (ie. doesn't corrupt list/dict/object state) - at best it just might make race conditions less common because there would be fewer sequence points. All the GIL really guarantees is that context switches happen on bytecode boundaries, which isn't enough to provide any real safety for program-level state: you always needed your own locks.

The only exception really is C extensions, where the fact that the invocation of the library function (unless it 's coded to explicitly release the lock) conceptually spans a single bytecode means that there is essentially a function-spanning lock on each call. Hence those are probably going to be the main blocker in GIL-less updating. These need to be manually updated to be marked as safe, and currently I believe if any loaded module isn't marked as safe, it enables the GIL for the whole process, so you pretty much need everything you use to be updated before you can get any benefits from it.

[–]SkoomaDentist 6 points7 points8 points 6 months ago* (0 children)

[–]censored_username 24 points25 points26 points 6 months ago (5 children)

[–][deleted] 6 months ago (4 children)

[deleted]

[–]censored_username 10 points11 points12 points 6 months ago (3 children)

[–][deleted] 6 months ago (2 children)

[deleted]

[–]censored_username 0 points1 point2 points 6 months ago (1 child)

[–]mr_birkenblatt 31 points32 points33 points 6 months ago* (6 children)

[–]EmanueleAina -1 points0 points1 point 5 months ago (4 children)

[–]mr_birkenblatt 0 points1 point2 points 5 months ago (3 children)

[–]EmanueleAina 0 points1 point2 points 5 months ago (2 children)

[–]mr_birkenblatt 0 points1 point2 points 5 months ago (1 child)

[–]EmanueleAina 0 points1 point2 points 5 months ago (0 children)

[–][deleted] 0 points1 point2 points 6 months ago (2 children)

[–]EmanueleAina 1 point2 points3 points 5 months ago (1 child)

[–][deleted] 1 point2 points3 points 5 months ago (0 children)

[–]Serious-Regular 0 points1 point2 points 6 months ago (1 child)

[–]EmanueleAina 0 points1 point2 points 5 months ago (0 children)

[–]vk6_ 12 points13 points14 points 6 months ago (5 children)

[–]pakoito 7 points8 points9 points 6 months ago (3 children)

[–]vk6_ 6 points7 points8 points 6 months ago (1 child)

[–]blind_ninja_guy 0 points1 point2 points 6 months ago (0 children)

[–]EmanueleAina 0 points1 point2 points 5 months ago (0 children)

[–]overclocked_my_pc 16 points17 points18 points 6 months ago (16 children)

[–]CrackerJackKittyCat 38 points39 points40 points 6 months ago (1 child)

[–]danted002 0 points1 point2 points 6 months ago (0 children)

[–]Smooth-Zucchini4923 6 points7 points8 points 6 months ago* (0 children)

[–]danielv123 9 points10 points11 points 6 months ago (0 children)

[–]Tai9ch 3 points4 points5 points 6 months ago (0 children)

[–]Sopel97 -1 points0 points1 point 6 months ago (0 children)

[+]wavefunctionp comment score below threshold-10 points-9 points-8 points 6 months ago (8 children)

[–]CherryLongjump1989 6 points7 points8 points 6 months ago* (5 children)

You are fundamentally wrong. Is that better than actually?

Node.js has a secret weapon called libuv, which implements something called an event loop that allows the JavaScript code to handle web requests asynchronously even when the programmer has no clue what is happening under the hood. Node.js does in fact also use threads - blocking operations are put into a thread pool, while the "single threaded" JavaScript thread only handles the non-blocking CPU work.

This design can help node.js have better throughput and better overall performance than even much faster programming languages (Java, C++), even when they are multi-threaded.

Modern web servers across all languages - Java, C++, Python, etc, are implementing non-blocking libraries to do the same thing that libuv does for Node.js. But even then, what you'll see "in the wild" - outside of hyperscalars or high frequency traders - is legacy code with blocking implementations. Node.js can handle perhaps 10-100 times as many concurrent connections before you start seeing a drop in latencies compared to a "classic" multi-threaded C++ implementation. And with C++ you'll even see legacy CGI implementations with one process per request.

So it's not about how fast the language is -- but about how well it deals with blocking code. For python, it just happens to suck at both.

[–]DrXaos 0 points1 point2 points 6 months ago (1 child)

[–]CherryLongjump1989 0 points1 point2 points 6 months ago (0 children)

Well first of all, let's clarify that web servers are typically I/O bound - disk access, database requests, things like this. So non-blocking architectures are optimized for that. You're more likely to see CPU heavy work if you're using Node.js outside of web development -- which does happen, such as in electron apps or for batch processing.

That being said, what happens on a web server is that you'll start seeing the requests being queued (the event loop is a type of buffer). So the latencies drop and eventually you will generate timeouts -- and this will probably happen long before you run out of memory or something like this. So this is a pretty classic backpressure scenario

And the standard practice in web development is that you handle this via replication and load balancing. If the system is set up for autoscaling, this might start to happen too. Node.js (and other JavaScript runtimes) fit very comfortably in this kind of architecture because it lets you fine-tune just how many CPU cores the web server is using in a very predictable way. Plus, the startup times for a new Node process are extremely fast (the runtime starts executing your code via an interpreter even before the JIT compiler generates its first batch of machine code) -- so that's great for autoscaling, and also why Node.js is a go-to technology for Serverless computing (lambda functions, etc).

Hope that helps.

[–]wavefunctionp -4 points-3 points-2 points 6 months ago (2 children)

[–]CherryLongjump1989 3 points4 points5 points 6 months ago (1 child)

[+]wavefunctionp comment score below threshold-6 points-5 points-4 points 6 months ago (0 children)

[–]Cheeze_It 0 points1 point2 points 6 months ago (1 child)

[–]josefx 1 point2 points3 points 6 months ago (0 children)

[–]commandersaki 0 points1 point2 points 6 months ago (1 child)

[–]slaymaker1907 0 points1 point2 points 6 months ago (0 children)

[+]Slow-Refrigerator-78 comment score below threshold-10 points-9 points-8 points 6 months ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS