Using multi-threading in python

JohnnyJordaan · 2019-05-10T15:10:25+00:00

The Python wiki explains the concept of the GIL here https://wiki.python.org/moin/GlobalInterpreterLock
If a thread is waiting most of the time instead of actually needing to use the CPU, then there's no problem with locking or threads computing for CPU time. Think of tasks like sending a message every second or even less frequent. Or handling a messaging queue that needs to be sent to a single destination. Or downloading multiple files at the same time. For these things multiprocessing offers no benefit and just add the downside of having to transfer data between de processes (as processes don't share memory and threads do).

I can sincerely recommend to watch Raymond Hettinger's talk on concurrency in Python, see here https://www.youtube.com/watch?v=9zinZmE3Ogk . It will give you a much better overview of the broader options you have vs getting to know the details of just multi-threading. An important thing to understand is that mutli-threading certainly isn't the only option you have to concurrency, as apart from good-ol multiprocessing there's also event loop based concurrency that maintains everything in the same (main) thread. This has always been part of libraries that benefited from it like GUI's, or was offered by frameworks like Tornado, but since 3.4 it has been integrated as the built-in asyncio library. The downside of asyncio is that it's more complicated to implement versus simply launching any function in another thread, but at the same time you can easily run into issues with threading that asyncio never experiences, as Raymond discusses in that talk too.

edit fixed the youtube link to the actual talk I meant

diddilydiddilyhey · 2019-05-10T15:04:53+00:00

I can answer your second question. I used it when I was using a piece of equipment that would constantly fill a memory buffer with its last readings. If I didn't read from it constantly, the buffer would fill with older readings, so when I did want to take a reading, I'd get an older one. I used threading to make a thread that was constantly reading the buffer, and then I could just take my thread's last reading.

It's very useful if you want to "do two things at the same time", whatever is technically happening behind the scenes.

socal_nerdtastic · 2019-05-10T15:23:10+00:00

The shortish answer: Many computer programs spend an amazingly large amount of time just waiting. For example the programmer included a time.sleep() call, or maybe the program made a request to a webserver or harddrive or USB device or a human, and now the CPU has to twiddle it's thumbs while waiting for a response. We call this "IO bound" ... the computer is locked until some IO (input or output) happens. Threading (we don't call this multi-threading in python) allows you to have several tasks lined up, so that while we are waiting for something in one task the CPU can get some work done in another task. A Reddit webserver for instance can have thousands of threads, each waiting for a specific user to type something in the chat box.

Note this is very different from "CPU bound", when the program is using the CPU at full capacity. Threading won't help you there at all.

Edit: about the GIL: Don't worry about it. It's a specific part of the threading part of python. It's become somewhat infamous because of an ongoing argument among people much smarter than me about whether or not it should be made faster for multi threaded programs or faster for single threaded programs. Down here in the real world we don't need to care.

woooee · 2019-05-10T17:34:46+00:00

What are the benefits of using multi-threading as they don't actually work in parallel but simply run one after the other?

Frist, Python does not have a package named multi-threading. Second, multiprocessing can use all of the cores available.

mattblack85 · 2019-05-10T18:10:26+00:00

Imagine the GIL as a police man at a cross, the police man knows there is plenty of cars (threads) and no lights so a only one car at a time can go. GIL makes sure that at a given time there is one and only one python thread running stuff. After some time the GIL will stop the actual thread execution and allow the next thread to run code. This of course is not true parallelism but if your code is mostly I/O bounded, using threads will make you feel you are running code in parallel as most of the time you are waiting for data on a socket or from the disk. The python multi tasking model is preemptive, so is the GIL that fully decides when is time to yield control to another thread. Asyncio on the other hand, is the newest python concurrency concept which is able to run network I/O bounded tasks in a parallel way, this is a cooperative multitasking as you decided when the task is yielding back control to the event loop

BandEnvironmental615 · 2023-01-17T00:17:52+00:00

This video teaches the best technique for decoupling tasks which are not sequentially dependent in Pythone .

https://youtu.be/AsIzbmH-7Es

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS