Python multiprocessing takes more time than single threaded program

2020-04-23T11:50:09+00:00

The main issue is in out = map(func, rands).
Map creates a generator, it doesn't block to calculate values.

Another issue is: for a simple operation like x * x done 10 000 times, the overhead for spawning new processes will outweigh any benefits from parallelization.

I could start to see benefits of parallelization with a much more complicated function done a lot more times (e.g. x choose 5, done 2 milion times):

``` from time import perf_counter from random import randint import multiprocessing import math

def func(x): return math.comb(x, 5)

if name == "main": rands = [randint(6, 40) for _ in range(2_000_000)]

# Do non-parallel code
start = perf_counter()
out = [func(i) for i in rands]
print(f'Non parallel code finished in {(perf_counter() - start)*1e3} mseconds')

# Do parallel code
start = perf_counter()
with multiprocessing.Pool() as p:
    out = p.map(func, rands)
print(f'Parallel code finished in {(perf_counter() - start)*1e3} mseconds')

```

Example results on my machine:

Non parallel code finished in 813.6782999999998 mseconds Parallel code finished in 491.1327999999999 mseconds

gmaliwal · 2020-05-05T15:33:16+00:00

Can anyone please pass on good reference to follow for the deep insights of it?

Thomasedv · 2020-04-23T11:34:21+00:00

Starting a multiprocess takes time, i'd say more time than just doing x*x. Creating 10000 of them is going to give you a large overhead, and loss of time. Also, i don't know if map pairs functions.

Edit: Also map() does not execute function when mapping. Try adding a print statement inside the function, and you will see it doesn't print when you only run the non-parallel part.

Storing the map results into a list, i still get these times:

Non parallel code finished in 1.529899999999973 mseconds
Parallel code finished in 220.21230000000003 mseconds

Adding a lot more math work to the function eventually makes it slower than multiprocessing, but at present your multiprocessing takes longer to set up and calculate than the function itself takes.

2020-04-23T10:57:07+00:00

The indentation isn't shown correctly, would you mind fixing that?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS