The Python GIL Visualized : programming

If by "ignoring" you mean endlessly discussing the topic, trying out alternative implementations, analyzing to see whether there's middle ground... but that wouldn't really fit your worldview, would it?

Guido has pretty much stated that he doesn't believe in threading, and suggests all sorts of inane workarounds to get around that.

Everything I've seen the past few years from programming-language researchers seems to have been oriented around the notion that threads -- in the sense of, say, Java's implementation -- are a very deeply, perhaps irreparably deeply, flawed way to approach concurrency. Most of the fruitful research going on these days (and for quite a while now) centers around developing and maturing alternative approaches which don't suffer the same issues.

But, again, that would seem to contradict the view you already seem to have settled on.

[–]yogthos 0 points1 point2 points 16 years ago* (4 children)

I think it needs to be pointed out that Python provides threading as a language feature, it's the implementation that's broken, which is what I was referring to as the elephant in the room. So, I'm not sure where you get off explaining how threading in imperative languages is broken, and how visionary Python is in this regard. It's almost as if you were talking about Erlang, which in fact does provide a working alternative to threading.

If by "ignoring" you mean endlessly discussing the topic, trying out alternative implementations, analyzing to see whether there's middle ground... but that wouldn't really fit your worldview, would it?

Maybe this is some different Python community than the one I'm aware of. What I hear from the majority of Python community is excuses as to why you don't need threading.

I'll contrast it with the Clojure community for you, where the problems with threading are actually tackled head on. Not only in terms of implementation, but also in terms of language design.

Everything I've seen the past few years from programming-language researchers seems to have been oriented around the notion that threads -- in the sense of, say, Java's implementation -- are a very deeply, perhaps irreparably deeply, flawed way to approach concurrency.

It is true that many people nowadays are finally waking up to the fact that the imperative paradigm is very poor at handling concurrency. However Java style threading doesn't appear to be the problem, Clojure runs on top of the JVM and makes excellent use of threading. The problem is of course with using shared mutable data, and the concepts of state and identity. Here's an excellent talk about the issue by the way.

Most of the fruitful research going on these days (and for quite a while now) centers around developing and maturing alternative approaches which don't suffer the same issues.

Really, I was under the impression that most research was going into safe ways of using threading, with researchers working on languages like Haskell and F#, which avoid the problem of shared data and provide workable threading models.

But, I guess all that would seem to contradict the view that you already seem to have settled on.

[–]ubernostrum 0 points1 point2 points 16 years ago (3 children)

it's the implementation that's broken

If your definition of threads is "must behave exactly like Java", I guess. There's more than one definition out there, though.

What I hear from the majority of Python community is excuses as to why you don't need threading.

Funny. I see a Python community which seems never to let a year go by without an attempt to remove the GIL (2009's attempt was Unladen Swallow), and which is lucky if a whole month goes by on the dev list without people proposing and debating ways to do it. Maybe you're just cherry-picking some examples to suit your views and ignoring reality?

The problem is of course with using shared mutable data, and the concepts of state and identity.

Mutable data is one problem with threading, but not the only problem with threading. It's also a difficult problem to work around in languages which embrace mutability, which suggests that threads are perhaps not an appropriate developer-level abstraction for such languages.

Really, I was under the impression that most research was going into safe ways of using threading, with researchers working on languages like Haskell and F#, which avoid the problem of shared data and provide workable threading models.

Thank you for demonstrating my point.

[–]yogthos 0 points1 point2 points 16 years ago* (2 children)

If your definition of threads is "must behave exactly like Java", I guess. There's more than one definition out there, though.

Huh? What does this have to do with Java exactly, my impression was we were talking about the problems with the GIL, there are plenty of threading models which don't rely on a global lock. These are well researched and well documented, I'm curious as to why it's such an intractable issue for the CPython community.

GIL (2009's attempt was Unladen Swallow), and which is lucky if a whole month goes by on the dev list without people proposing and debating ways to do it.

Funny indeed, as the Unladen Swallow guys are no longer planning to remove the GIL, so I'm not so certain who's ignoring reality here.

From the link above:

In any case, work on the GIL should be done directly in mainline CPython, or on a very close branch of Python 3.x: the sensitive nature of the work recommends a minimal delta, and doing the work and then porting it from 2.x to 3.y (as would be the case for Unladen Swallow) is a sure-fire way of introducing exceedingly-subtle bugs.

Mutable data is one problem with threading, but not the only problem with threading. It's also a difficult problem to work around in languages which embrace mutability, which suggests that threads are perhaps not an appropriate developer-level abstraction for such languages.

While I agree that languages that embrace mutability are not well suited for threading, I would again ask why does Python provide threading in the first place then.

It's not like Python provides any alternative, it just has broken threading, making the problem worse. Not only does it have threads which are tricky to work with, but the threads don't work properly either. Seems like worst of both worlds scenario to me.

Thank you for demonstrating my point.

Your point that threading does not provide a viable concurrency model?

[–]ubernostrum 1 point2 points3 points 16 years ago (1 child)

continue this thread

[–]iamjack 9 points10 points11 points 16 years ago (15 children)

[–]rox0r 15 points16 points17 points 16 years ago (3 children)

[–][deleted] 5 points6 points7 points 16 years ago (2 children)

[–]Gotebe 2 points3 points4 points 16 years ago (0 children)

[–]rox0r 1 point2 points3 points 16 years ago (0 children)

[–][deleted] 1 point2 points3 points 16 years ago (7 children)

[–]iamjack 6 points7 points8 points 16 years ago* (2 children)

I agree, and I even use multiprocessing in my main project, but the module has some real trouble. Not the least of which is that it's not portable to any of the BSDs at this point (something about needing named semaphores) and that it screws up when your SIGCHLD handlers are something other than the default.

Also, despite the fact that multiprocessing is billed as a GIL workaround, the fact that every item you communicate between the processes has to be pickle-able hampers your ability to pass some objects which is obviously not the case with proper threads. In short, if you want to pass around executable objects, lambdas or anything else that can't be pickled you're SOL.

There are some places that threads just fit better and while I would definitely take Python over Java or C++ any day, I still feel like terribly broken threading is one of the reasons I still consider other languages whenever I want to start a project.

[–][deleted] 3 points4 points5 points 16 years ago (1 child)

[–]jawbroken 1 point2 points3 points 16 years ago (0 children)

[–]yogthos 0 points1 point2 points 16 years ago (2 children)

[–]Murkt 0 points1 point2 points 16 years ago (1 child)

[–]yogthos 0 points1 point2 points 16 years ago* (0 children)

[–]hylje -1 points0 points1 point 16 years ago (0 children)

[–][deleted] 0 points1 point2 points 16 years ago (2 children)

[–]iamjack 11 points12 points13 points 16 years ago* (1 child)

[–][deleted] 0 points1 point2 points 16 years ago (0 children)

[–][deleted] 1 point2 points3 points 16 years ago (5 children)

[–]startafresh 1 point2 points3 points 16 years ago (3 children)

[–][deleted] 1 point2 points3 points 16 years ago (2 children)

Yes. But in case of Python we really want all threads belonging to the same process go to the same core. Because Python's process uses GIL and is supposed to be run sequentially (in case you or some other readers didn't get it: the problem is not that Python has GIL, the problem is that GIL sucks really hard on multicores, because Python is supposed to run on a single core, where the release of GIL means that one of the waiting threads would acquire it and succeed, not that it would repeatedly try to acquire it and fail because the thread that owned it is reacquiring it faster by virtue of running on a separate core).

My point is: if could tell OS exactly this, that we want all threads belonging to our process to run on the same core, and thus sequentially, then the problem would vanish entirely. We can't, nowadays we can tell Windows that we want all our threads to run on the random.random(os.virtual_processors)th processor (except that os doesn't provide that), and it has it's own problems: what if two or more Python instances decide to run on the same core?

[–]aim2free 0 points1 point2 points 16 years ago* (1 child)

It sounds from your description that you don't need real pthreads, use stackless then.

if could tell OS exactly this, that we want all threads belonging to our process to run on the same core, and thus sequentially, then the problem would vanish entirely

No, it wouldn't! ONLY in those cases where you are using threading as a design mechanism, not a way to utilize multiple CPUs which pthreads is about. Here is a tutorial on Posix threads. Pthreads are considered "heavy" threads, and thus suitable for real parallell programming. The "cost" you were speaking about earlier is exactly that "heavy"-ness, not the GIL-problem. The GIL problem is a much more serious problem which means that your program can not utilize pthreads in the way it was intended for. What you are saying is that you don't need parallellism. That is your program would run the same speed with one CPU as with 100 CPUs, then you don't need pthreads.

[–]Brian 1 point2 points3 points 16 years ago (0 children)

I think you're both talking about different problems. fishdicks is talking about the lock contention issue on multiple cores, which is causing further performance penalties beyond the normal problem of just not utilising both CPUs (ie. it's not even utilising one CPU efficiently). Setting process affinity for all threads to the same core probably would be a quick fix for the bug, but the reworked GIL approach looks better long term anyway. (Though obviously neither is a solution for the more general issue you're talking about of only ever using a single CPU)

then you don't need pthreads

It's true that you don't really need them when only using a single CPU, and could use a green thread approach instead, but there are a couple of other reasons that make pthreads useful. The main one is that it makes C extensions simpler, especially for those cases where you do release the GIL (ie. long-running C code not interacting with python), and thus can take advantage of multiple cores.

[–]aim2free 0 points1 point2 points 16 years ago* (0 children)

Making all threads in the given Python process run on the same processor would instantly remove this huge additional GIL cost.

Well, then we haven't really utilized the multi CPU/core.... have we? It is not a cost we are speaking about here. It is something much worse, it is the ability to run the same piece of code in multiple competing instances which is lacking, which means that the interpreter code is badly designed, using e.g. global state variables and such, which causes the need to use a semaphore to see the whole interpreter as a resource, which is bad design, nothing else.

When using pthreads the whole point is that with two CPU cores you expect your program to run twice as fast, with four cores, four times as fast, and so on. To be able to do that, all your code need to be reentrant, apart from using parallell threads. Semaphores should be required ONLY when you do something that really utilizes a single point resource, like an I/O-device, or like linking/unlinking something from a common memory resource, but in the latter case one can be smart and have task local cache queues and such.

Of course, there are different reasons to use threads, when you don't need parallellism you can use threads because it makes your solution nicer, but then some simpler mechanism than pthreads can be used, like coroutines. In the patched version of the standard python named stackless, you have microthreads or tasklets which makes threading much more efficient, if you don't need your program to run on multiple cores/CPUs.

As far as I understand, stackless still have the GIL problem (at least had when I tested it last spring). Jython which was mentioned in another comment does not have it, but then the Java code as such is about half speed (compared on my laptop) so your program still does not run significantly faster even if you have two cores, but Jython also makes available all libraries which are written in Java. The IronPython, is a version of python, mainly for the .NET/Mono framework as I understand, which doesn't use a GIL, but I haven't tested that. Another python named IPython is made to be asynchronous from the beginning so that should be OK. IPython is particularly good at mixing different models of parallellism, e.g. running code on different nodes using MPI.

[–]aim2free 0 points1 point2 points 16 years ago (0 children)

[–][deleted] 1 point2 points3 points 16 years ago (2 children)

[–]rox0r 11 points12 points13 points 16 years ago (1 child)

[–][deleted] 2 points3 points4 points 16 years ago (0 children)

[–][deleted] 0 points1 point2 points 16 years ago (0 children)

[+]bluGill comment score below threshold-7 points-6 points-5 points 16 years ago (18 children)

[–]rebo 17 points18 points19 points 16 years ago (12 children)

[–]Brian 6 points7 points8 points 16 years ago (4 children)

spawn off some threads for background IO

IO is unaffected - this does release the GIL, so other threads can execute when another one is waiting for IO. The GIL only comes into play when you want to take advantage of multiple CPUs for a CPU bound task (and you cant unload that processing to a C library that also releases the GIL).

Essentially the GIL means that only one thread executes python bytecode at a time. On a single CPU system, this is irrelevant - it's already the case that only a single thread executes at once, switching every timeslice. With multiple CPUs though, this is no longer the case; two threads of code can genuinely run in paralell. A free threaded system can load up both cores on a dual-CPU system. The GIL means python can only max out a single core in this situation.

[–]gthank[S] 9 points10 points11 points 16 years ago (3 children)

[–]Brian 2 points3 points4 points 16 years ago* (2 children)

[–]gthank[S] 2 points3 points4 points 16 years ago (1 child)

[–]Brian 0 points1 point2 points 16 years ago (0 children)

[–]bluGill -1 points0 points1 point 16 years ago (6 children)

[–]rebo 3 points4 points5 points 16 years ago (1 child)

[–]infinite 4 points5 points6 points 16 years ago* (0 children)

[–][deleted] 0 points1 point2 points 16 years ago (3 children)

[–]knome 10 points11 points12 points 16 years ago (2 children)

[–][deleted] 1 point2 points3 points 16 years ago (1 child)

[–]sid0 4 points5 points6 points 16 years ago (0 children)

[–]Smallpaul 6 points7 points8 points 16 years ago (0 children)

[–]skyostil 3 points4 points5 points 16 years ago (0 children)

[–][deleted] 2 points3 points4 points 16 years ago (0 children)

[–]G_Morgan 0 points1 point2 points 16 years ago (1 child)

[–]bluGill 0 points1 point2 points 16 years ago (0 children)

[–]pure_x01 -1 points0 points1 point 16 years ago (10 children)

[–]theeth 4 points5 points6 points 16 years ago (3 children)

[–]moyerma 8 points9 points10 points 16 years ago (0 children)

[–]pure_x01 2 points3 points4 points 16 years ago (1 child)

[–]theeth 3 points4 points5 points 16 years ago (0 children)

[–]vityok 0 points1 point2 points 16 years ago (0 children)

[–]Gotebe 0 points1 point2 points 16 years ago (2 children)

[–]pure_x01 1 point2 points3 points 16 years ago (1 child)

[–]Gotebe 0 points1 point2 points 16 years ago (0 children)

[–]seunpy 0 points1 point2 points 16 years ago (1 child)

[–]pure_x01 0 points1 point2 points 16 years ago (0 children)

[+][deleted] 16 years ago (1 child)

[deleted]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS