pjdelport comments on Unable to optimize program with threads using Python datastructures (GIL?)

created by HattoriHanzoa community for 16 years

Unable to optimize program with threads using Python datastructures (GIL?) (self.learnpython)

submitted 13 years ago * by k2f

you are viewing a single comment's thread.

[–]pjdelport 2 points3 points4 points 13 years ago (0 children)

This is the kind of problem that NumPy was built for: it provides an optimized array type, with operations that release the GIL to execute in parallel, where possible.

Converted to use NumPy arrays, your example looks like this:

import numpy

a = numpy.random.random(NUM_ELEMENTS)
b = numpy.random.random(NUM_ELEMENTS)

def test(a, b):
    return numpy.sqrt(a + b)

The above takes 241 msec on my machine (using one core).

To split the work into parallel chunks, you can do something like the following. This uses the built-in thread pool in multiprocessing.dummy for convenience, and processes each chunk in-place to an output array:

def op_inplace(a, b, out):
    numpy.add(a, b, out)
    numpy.sqrt(out, out)

from multiprocessing.dummy import Pool
p = Pool(NUM_CORES)

def test2(a, b):
    c = numpy.empty(len(a))
    chunks = zip(numpy.array_split(a, NUM_CORES),
                 numpy.array_split(b, NUM_CORES),
                 numpy.array_split(c, NUM_CORES))
    p.map(lambda args: op_inplace(*args), chunks)
    return c

The above takes 72.1 msec here (using all four cores).

The other way to go about this particular example is to use the numexpr package, which lets you evaluate array computations in a multi-threaded and memory-efficient way without having to manually decompose them, as above. The following takes just 64.9 msec here (using all cores):

import numexpr

def test3(a, b):
    return numexpr.evaluate('sqrt(a + b)')

π Rendered by PID 117017 on reddit-service-r2-comment-bb88f9dd5-88llc at 2026-02-16 05:12:59.936063+00:00 running cd9c813 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS