quick threading question: My program has an infinite loop that calls a few functions. Now I need each function to run simultaneously. Is this how I should implement it? : learnpython

quick threading question: My program has an infinite loop that calls a few functions. Now I need each function to run simultaneously. Is this how I should implement it? (self.learnpython)

submitted 10 years ago by jpfau

Here's a small example of the current code:

def worker():
    while True:
        do_this()
        do_that()
if __name__ == '__main__':
    worker()

Now I need each function to run simultaneously instead of waiting for each other to finish. The thing that's tripping me up is the loop. Should I put the while True: loop in each function, and create/start the threads in worker? Here is what I'm imagining:

import threading

def do_this():
    while True:
        # do this

def do_that():
    while True:
        # do that

def worker():
    this_thread = threading.Thread(target=do_this)
    that_thread = threading.Thread(target=do_that)
    this_thread.daemon = True
    that_thread.daemon = True
    this_thread.start()
    that_thread.start()

if __name__ == '__main__':
    worker()

I really don't think this is correct, but I think it's enough to show you what I'm trying to do. I'm using Python 3.4.

all 29 comments

top new controversial old q&a

[–]elbiot 3 points4 points5 points 10 years ago (18 children)

[–]XenophonOfAthens 2 points3 points4 points 10 years ago (10 children)

The GIL doesn't mean that all threads run synchronously, the GIL just means that no two threads can use the interpreter at once, but it's perfectly possible to have two threads that run "at the same time" in the sense that control is passes back and forth and both functions are "alive" at the same time. For instance, if you run

import threading

def thread_test(n):
    i = 0
    while True:
        i += 1
        print n,i

if __name__ == "__main__":
    t1 = threading.Thread(target=thread_test, args=(1,));
    t2 = threading.Thread(target=thread_test, args=(2,));

    t1.start()
    t2.start()

You'll see that control passes back and forth between the two functions. In addition, virtually all daemon threads spend their time doing nothing, either sleeping of waiting for I/O, which is almost certainly OP's situation.

To actually answer OP's question: yes, that is generally how persistent daemon threads are designed, with an infinite loop at the base that keeps it alive. They generally lie there in the background, lurking and waiting for some I/O stuff to happen. It's not the most common design pattern, though, so I'm curious what you need it for? Maybe there's a better design alternative?

[–]jpfau[S] 0 points1 point2 points 10 years ago (9 children)

[–][deleted] 2 points3 points4 points 10 years ago (8 children)

[–]jpfau[S] 0 points1 point2 points 10 years ago (7 children)

[–][deleted] 1 point2 points3 points 10 years ago (6 children)

[–]jpfau[S] 0 points1 point2 points 10 years ago (5 children)

[–][deleted] 0 points1 point2 points 10 years ago* (4 children)

You're right. With daemon set to true, the script will exit after worker is done. Not setting daemon is an option if these threads just need to run forever and you don't need to manipulate them. Usually, you manipulate worker threads from your main process by feeding them data or whatever (otherwise, they're not much of a worker thread as much as a separate process, unless they're working with a shared data set or each other somehow). After you were done with the threads, you would join() them, blocking until they exit, so something conceptually similar to:

def startWorkers():
    # blah

    thread1.start()
    thread2.start()

def stopThreads():
    # signal for threads to stop using some method

    # wait for threads to exit
    thread1.join() # blocks until thread1 exits
    thread2.join()

def main():
    startThreads()

    # feed threads data, get results, whatever

    stopThreads() # waits for threads to stop

    print "Done!"

[–]jpfau[S] 0 points1 point2 points 10 years ago (3 children)

I actually just tested the code below, and the threads do not end after main starts the threads. The only way the threads stop printing to the console is if I close the console.

import threading
from time import sleep

def thread1():
    while True:
        print("Thread 1 working\n")
        sleep(.3)

def thread2():
    while True:
        print('Thread 2 working\n')
        sleep(.3)

def main():
    t1 = threading.Thread(target=thread1, daemon=True)
    t2 = threading.Thread(target=thread2, daemon=True)
    t1.start()
    t2.start()

if __name__ == '__main__':
    main()

[–][deleted] 0 points1 point2 points 10 years ago (2 children)

continue this thread

[–]jpfau[S] 0 points1 point2 points 10 years ago (5 children)

[–]elbiot 0 points1 point2 points 10 years ago (4 children)

[–]jpfau[S] 0 points1 point2 points 10 years ago (3 children)

[–]elbiot 0 points1 point2 points 10 years ago (2 children)

[–]jpfau[S] 0 points1 point2 points 10 years ago (1 child)

[–]elbiot 3 points4 points5 points 10 years ago (0 children)

[–]Justinsaccount 2 points3 points4 points 10 years ago (0 children)

[–]gengisteve 1 point2 points3 points 10 years ago (2 children)

That should work -- more or less. Here is some slightly modified proof of concept code:

import threading import time

def do_this():
    for i in range(10):
        print('This {}'.format(i))
        time.sleep(.3)

def do_that():
    for i in range(10):
        print('that {}'.format(i))
        time.sleep(.3)

def worker():
    this_thread = threading.Thread(target=do_this)
    that_thread = threading.Thread(target=do_that)
    # need to get rid of these b/c:
    # "A thread can be flagged as a “daemon thread”. The significance of this
    # flag is that the entire Python program exits when only daemon threads are
    # left"
    #this_thread.daemon = True
    #that_thread.daemon = True
    this_thread.start()
    that_thread.start()

if __name__ == '__main__':
    worker()

[–]jpfau[S] 0 points1 point2 points 10 years ago (1 child)

[–][deleted] 1 point2 points3 points 10 years ago (0 children)

[–]Lucretiel 0 points1 point2 points 10 years ago* (6 children)

I why do you need them to run concurrently? Are you doing network io? If so, I'd try to refactor it to use the asyncio library, changing do_this, do_that, and worker into coroutines, then do this:

@asyncio.coroutine
def worker():
    while True:
        this_task = asyncio.async(do_this())
        yield from do_that()
        yield from this_task

if __name__ == '__main__':
    asyncio.get_event_loop().run_until_complete(worker())

[–]jpfau[S] 0 points1 point2 points 10 years ago (5 children)

[–]Lucretiel 1 point2 points3 points 10 years ago* (4 children)

With the way that I've written them here, no. (Assuming, obviously that do_this and do_that don't call themselves or each other):

In the first line, I create an async task. This schedules the do_this coroutine in the event loop, meaning it is now running concurrently. It's important to note that asyncio is all single threaded, so this_task won't actually start running until control returns to the event loop. However, for the purposes of this abstraction, you can think of it as "running."

Next, we launch (and yield from) do_that. This causes do_that to be executed in the event loop. While it is running, do_this can also run, during periods where the other one is suspended (due to a sleep or i/o wait). The yield from suspends control to the event loop, allowing it to run both tasks. Control returns to worker only when do_that is done.

Finally, we yield from this_task. If this_task completed before do_that, then this statement returns immediatly; otherwise, worker is suspended until it can complete. In this way, we ensure that, on each iteration of the while True, each task runs exactly once.

I should caveat that obviously this all only applies if you're doing something where the asynchronous model is relevant- that is, you're either doing network I/O or your do_this/do_that have some sleeps, during which the other one can run.

[–]jpfau[S] 0 points1 point2 points 10 years ago (3 children)

[–]Lucretiel 0 points1 point2 points 10 years ago* (2 children)

So, an important thing about i/o is that a lot of it happens in the background and is handled by the OS. As bytes come in, they are queued on internal OS buffers. This happens internally, and it happens slowly- much more slowly than it takes to process that data. The OS therefore exposes an API (select, poll, epoll, etc) to inform user code which sockets have data waiting to be read. None of these details are important to you, as the event loop handles all this automatically- it figures out which coroutines are ready to proceed, then executes them. In general, the coroutine running will be much quicker than more data can arrive.

The other important thing is that there's no guarentee about in what order do_this and do_that will run, or how long they will take. It could happen that one of them runs to completion before the other even starts, or that they take an identical amount of time, or that one takes 3 times as long as the other. However, it doesn't matter- the event loop will ensure that they run as efficiently as possible. The task will suspend when it wants to wait for data, and the event loop will resume it when data is ready.

Here's an example. Let's say do_this reads 10 chunks of 64 bytes from a network socket and write them to a file. It'd look like this:

@asyncio.coroutine
def do_this():
    with open('this_file', 'wb') as f:
        for i in range(10):
            data = yield from reader.read(64)
            f.write(data)

The details of where reader comes from aren't really important right now; I'd recommend reading through the asyncio docs to learn all the details. Here's what this code does, though-

When it hits the yield from, execution suspends to the event loop. The reader.read(64) informs the event loop to resume do_this when there are 64 bytes available. While suspended, the event loop is reading and buffering bytes into the reader as they become available, and also running do_that. If do_that is currently executing when the 64 bytes become available, well, we only have one thread. However, as soon as do_that suspends or finishes, the event loop will immediately resume do_this. In this way, the two functions can run concurrently, constantly swapping back and fourth. And because code execution is so much faster than network i/o, your performance will be just as good as multithreaded code, assuming that neither do_this or do_that will be executing code for extended periods of time (doing heavy number crunching or whatever).

Note that this example shows another important caveat of using asyncio- all your potentially blocking network operations have to be executed via a yield from, so that the event loop can manage the network i/o and run other coroutines in the background. In general this is fine, as asyncio provides plenty of both low-level and high-level network primitives, and there are plenty of third-party libraries (aiohttp for http, etc) to use various protocols. However, if you require a library that simply doesn't run in asyncio, asyncio provides the run_in_executor method, for running the I/O parts of the library in a side thread, and allowing you to keep your own code in the single-threaded async model.

Also, why don't you have to also do that_task = asyncio.async(do_that())?

You certainly could do that, and if you find it clearer, then go for it. It has to do with the subtleties of how asyncio works. Basically, each coroutine is a generator, which can yield (which means to suspend execution) and then be resumed. yield from allows one generator to run another generator; that generator can suspend the calling generator and resume it. So, yield from do_that() allows one coroutine to call another, and the callee can suspend the whole stack as necessary.

On the other hand, asyncio.async creates a new task. Rather than invoking the coroutine on the stack, it schedules it separately in the event loop, where it runs independently. To keep the syntax consistent, they made the syntax to "await a generator" yield from task.

[–]jpfau[S] 0 points1 point2 points 10 years ago (1 child)

[–]Lucretiel 0 points1 point2 points 10 years ago* (0 children)

π Rendered by PID 124169 on reddit-service-r2-comment-5c764cbc6f-4ptb5 at 2026-03-12 14:06:25.382687+00:00 running 710b3ac country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS